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PREFACE 


The present volume deals with variational methods for the optimization of motion of aircraft and other 
objects often encountered in modern technology. The development of aeronautics and astronautics progres- 
sively focuses ever increasing attention on the determination of optimum flight programs characterized by 
ininimum time, minimum fuel consumption, maximum range, minimum cost, etc, Numerous publications 
which appeared in the USSR and in the West during the last 20—25 years deal with various aspects of this 
optimization problem, The leading contributions were made by I, V. Ostoslavskii, D,E.Okhotsimskii, A. A. 
Kosmodem' yanskii, and A, Miele, Their work played an important role in the development of controlled 
flight mechanics; they were also the first to come face to face with the fundamental difficulties that to this 
date plague us in the’ solution of modern optimal problems, These difficulties are a result of the complex 
and varied conditions that must be taken into consideration when formulating and solving the various problems, 
The following factors fall under this category: 

1) various inequality restraints are imposed on the system variables by physical and practical considerations 
(restraints on altitude and velocity, on the overload, angle of attack, engine thrust, etc.); 

2) one is required to determine the absolute minimum, rather than a local (or relative) minimum; 

3) the sought optimal control often does not exist in the class of admissible controls in the classical for~ 
mulation of the problem or, in more general terms, we have to deal with degenerate problems, 

The above features are characteristic of all optimal control problems, and are not specific to flight 
dynamics, The difficulties can be overcome in two ways, The first approach called for modernizing the 
standard classical methods (transformation of variables, introduction of additional variables, direct analysis 
of variations), The second approach involved more radical measures, i,e., development of new optimal 
principles with logic not affected by the above considerations, 

It is this second approach that produced the new optimization methods due to R. Bellman (dynamic 
programming), L.S, Pontryagin (the maximum principle), and the variational methods which constitute the 
basis of our book. 

The monograph presents a systematic review of the authors’ results obtained over a number of years, It 
is divided into six chapters and a Supplement, Chapter I reviews the elements of the theory of variational 
methods, Unlike the traditional approach, the variational technique also covers the case when the sought 
optimal control does not exist in the class of admissible controls and the solution must be derived by construct- 
ing a minimizing sequence, This is a typical case often encountered in modern practice. The fundamental 
theorem of the sufficient conditions of the absolute minimum of a functional is then applied to reduce the 
problem of functional minimization to the problem of maximum of some function R of the problem variables 
for every fixed value of the argument, 

Chapter II describes in detail a number of particular methods, The equations of Pontryagin's maximum 
principle are derived in § 2,1, and the reader is acquainted with this method from a new angle, These 
equations are then reinforced with new conditions of a strong local (relative) minimum, Significantly, these 
conditions are linked with the classical sufficient conditions of variational calculus, Jacobi's famous condition 
is seen to be equivalent to the requirement of negative-definiteness of the second differential of the function R, 

The important concept of the synthesis of optimal control is also dealt with in this chapter, Bellman's 
equation is derived as a particular case from the conditions of the fundamental theorem, and the different forms 
of solution of the optimal problem — synthesis and program — are briefly illustrated, The complete solution 
for the synthesis of linear systems is given, The method of approximate synthesis and, primarily, the a priori 
upper-bound estimate of the solution accuracy are of the greatest practical importance among these topics, 
New methods of this kind are of great value, since the Bellman equation often proves too complicated for 
practical application and its exact solution too problematic, 

Chapter III considers the structure and the analysis of degenerate and sliding controls. The treatment 
starts with a description of the construction of a sliding-control minimizing sequence; the construction is 


vii 


first described for a simple example and then in a more general form, depending on the properties of the 
function R, The methods of Pontryagin and Bellman are then generalized to the case of degenerate solutions, 
The last sections of the chapter present a new special method particularly suitable for degenerate and sliding 
controls, the method of multiple maxima, This method is highly effective for flight dynamic problems. 

Also of considerable interest is the theoretical application of this method to the investigation of the 
degenerate second variation of the functional, This approach leads to a natural generalization of all the 
standard conditions of variational calculus, including Jacobi's global condition, to the case of degenerate 
solutions, 

This method is still in the development stage, but the results summarized in this book are quite sufficient 
to illustrate its great potential, in particular, for qualitative analysis of various problems with the purpose of 
deriving "quick" estimates, This is a highly valuable attribute at the design and synthesis stage, when exact 
solutions are meaningless, insofar as the system parameters have not been fixed, 

The next two chapters deal with applications, Chapter IV presents solutions of a number of problems 
of powered flight dynamics, First to be considered is the elementary problem of the vertical ascent (descent) 
of arocket in vacuum. The well-known solution of this problem is derived here as an illustration of the 
applications of the new methods (here, as in most other applications, the method of multiple maxima is 
employed), The general problem of rocket dynamics in a homogencous field in a vacuum is then considered, 
The application of the new method leads to an original treatment of this problem and yields new simple 
equations of optimal motion, with coasting integrals playing the role of variables and the direction of thrust 
and time figuring as controls, 

Next to be considered is the so-called classical problem of flight dynamics, e.g., the problem of powered 
ascent of an aircraft, It was previously solved by different methods by Ostoslavskii, Egorov, and Miele, The 
application of our new method yields a more accurate result; in the case of several extremals (e,g,, the 
typical situation in supersonic flight) the transition points between the different extremals are naturally 
determined. 

The last section of chapter IV illustrates a detailed construction of an approximate synthesis of optimal 
aircraft control on the ascent section, The synthesis of optimal control is the most desirable form of solution 
of the optimal problem, but it is more time- and labor-consuming than the construction of the optimal 
program, This feature is probably linked with the infrequent use of synthesis solutions in nonlinear systems, 

Chapter V investigates optimal coasting controls of winged aircraft subjected to lift, Minimum-time 
and minimum-heat descent programs are discussed, using various characteristic restraints (on altitude, angle 
of attack, temperature, overload, etc.), The method of multiple maxima is again very effective in this 
case, as it yields approximate optimal solutions (with detailed accuracy estimates) in the form of finite ex- 
pressions, without involving the engineer in a tedious and difficult solution of boundary-value problems. 

Note that the sought optimal control is found to bea sliding control for the angle of attack over a considerable 
length of the trajectory; in practice, it can be realized by a relay control with a moderate switching frequency, 

Chapter VI presents an analog of the optimum principle for discrete control systems described by finite- 
difference equations. Although the treatment is purely theoretical, its practical value is self-evident in the 
light of ever increasing coinputer applications, Optimal problems using differential equations are converted 
into finite-difference schemes for computer solution, 

The Supplement at the end of the book is fundamentally different from the earlier chapters, It will be 
of interest to readers who wish to acquaint themselves with the basic problems of variational calculus and 
with the fundamental ideas that served as a point of departure for this monograph. The Supplement also 
contains a useful solution of the problem of maximum-range horizontal flight of an aircraft, 

The authors acknowledge the great help of Acad. A.M. Letov, who reviewed the manuscript and made 
a number of valuable comments, The authors are also grateful to the scientific editor of the book, I, V. 
Ioslovich, for his efforts. 

Reader’ s comments will be most welcome, Please address all correspondence to "Mashinostroenie" 
Publishing House, Moscow, K-51, Petrovka 24, 
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INTRODUCTION 


Various problems of science and technology often require choosing 
the best, or the optimum, solution among a set of all the possible 
alternative solutions. The mathematical formalism of problems of this 
kind generally operates with two concepts: the concept of a set and the 
concept of a functional defined over a given set. Examples of sets 
are the set of all real numbers between 0 and 1, the set of all possible 
directions of the thrust vector of an aircraft engine, the set of all possible 
trajectories taking an aircraft to a given altitude, etc. 

A functional is said to be defined over some set if we know how to 
assign a definite number to each element of the set. 

Consider some simple examples of functionals. 

1. Consider the set of all plane curves. To each curve we assign a 
certain number — its length. The length of a curve is thus functional. 

2. Consider a point in the (x, y) plane which may move along any path 
between two given points A and B so that it has a definite velocity V(x, y) 
at every point (x, y) of the path. Assigning to each path of the set the time 
to traverse that path, we obtain a functional. 

3. The maximum surface temperature of a spacecraft during atmo- 
spheric reentry is a functional which depends on the trajectory of the 
spacecraft. 

The general formalism of choosing optimum solutions (according to 
certain "quality" criteria) operates with a fixed set M and a functional 
I(v), ve M, defined over the elements of this set which characterizes the 
quality of the element v, so that if an element y is "better" than an 
element v2 in a certain sense, /(v,)</(v2) and vice versa. The problem 
is thus formulated as follows: find an element uv of the set M for which the 
functional /(v) has its minimum value /(v)=m,. This problem, however, 
is not always solvable. The set being considered does not always contain 
an element v satisfying the exact equality /(v)=m. For example, the 
function y=ax+b, a>0, attains its minimum at the point x=0 on the set of 
the points *« between 0 and 1, provided this set is closed, 0<x<1; the 
minimum is not attained on any point of the set if itis open, 0<x<1l. A 
slight modification in the statement of the problem eliminates the difficulty. 
The final formulation reads as follows: consider a set M and a functional 
/(v)on this set, vCM. Find a sequence of elements of the set M, 
{v;}CM, for which I(v,)—-m for s—-co. The resulting sequence is known 
asa minimizing sequence. It always exists, in virtue of the 
definition of the exact lower bound m=int f . The minimal element 7 may 


be treated as one of such minimizing sequences defined in the form v,=v 
for all s. 


In what follows, we will deal with more tangible problems: determination 
of the optimal control for objects described by ordinary differential 
equations or by their discrete analogs — the finite-difference equations. 

As a preliminary step, we have to consider in some detail the concept of a 
controlled system (or controlled object). 

A typical example of a controlled system is provided by an aircraft. 

The vector equations of motion of the center of mass of an aircraft in 
an inertial frame of reference have the form 


mV =F; 
P2y) (I. 1) 
a= — Br, 


where ms V are respectively the radius vector and the velocity vector of the 
center of mass of the aircraft; mis the mass; f;is the mass (propellant) 
consumption per second; Fis the resultant vector of all the external forces. 

When solving the equations of flight dynamics, the motion of an aircraft 
is generally considered in the wind system of coordinates /4/: 


V=— [Pcos(a—4)cosB— X cosB-+ Z sinB-— 
m 

—G sin 4); 

j=—_{P [sin(e—g) cos y-+cos(a—g)sinBsiny,J— 

m 

—XsinBsiny, +Y cos y,— Z cosBsiny, —G cos 9}; (1. 2) 

. 1 : : 

le are {P[sin(a— 9) sin y, — cos (a— 

—)sinBcosy,]+.¥ sinBcosy,+Y siny, + 

+ ZcosB cos y,}; 

h=V sin0; m= —®6;: x=V cos, ) 


where V is the velocity; A the altitude; x the range on the Earth's surface; 
0 the angle of inclination of the trajectory to the local horizon; a the angle 
of attack; the angle of yaw; the angle of side slip; y,the angle of bank; 
@ the angle between the thrust vector and the velocity vector; P engine 
thrust; Gaircraft weight; X, Y, Zthe drag, the lift, and the side force, 
respectively: 


X=e,(M, a) £4" g. ¥=c,(M,a)2@" S: 


Z=c,(M,a) 20)" S, 


where o(/) is the density of the atmosphere; S is the effective wing area; 

M is the Mach number, equal to the ratio of the flight velocity to the velocity 
of sound a(f) at the given altitude; cx, cyand c,; are the aerodynamic 
coefficients. 


Equations (I. 2) relate two essentially different groups of variables. The 
variables h, V,6, pc, x and m enter (1.2) together with their first deriv- 
atives and thus characterize the state of the system at any time f¢; the 
number of these variables is equal to the order of the system. 

Such variables are generally known as phase coordinates. The 
variables a,f,yc,9, and B- enter (1.2) without their derivatives and thus 
act as free variables. They may be defined quite arbitrarily as certain 
functions of time, and they determine the solution of the system (the 
behavior of the controlled system) for a given initial state Mo, Vo, %, co, Mo. 
Variables of this kind are known as control elements orcontrols. 

The classification of the variables into phase coordinates and control 
elements is closely linked with the particular choice of the mathematical 
model of the controlled system. In some problems the mathematical 
model (system (I. 2)) provides an insufficiently accurate description of the 
actual behavior of the aircraft and it can be improved by supplementing it 
with the equation of the angular motion of the aircraft about its center of 
mass. In this system, the variables a, B and ye become phase 
coordinates, and the rudder deflection angles assume the role of control 
elements. On the other hand, in some problems, certain phase coordinates 
may be upgraded to the status of control elements without any detrimental 
effects; this would involve dropping the corresponding differential equations 
from the mathematical model. This approach was actually applied by some 
authors /4,8/ in solving the problems of powered ascent of aircraft, when 
the trajectory inclination angle 0 was used as a control element. 

In practical problems, besides differential equations we have to consider 
further a variety of constraints and conditions on the variables which stem 
from the particular properties of the controlled system. The following 
typical constraints are imposed on aircraft flying in the denser atmospheric 
layers: the altitude h>0, the angle of attack amin <aSamax, the dynamic 


head g='/29V?<qmax» the total overload N= > V X2(h,Va) FY? (V0) < 


SN max, the surface temperature 7,(h, V, a)<Tumax. Certain additional 
conditions are also imposed on the initial and the final state of the object. 
Thus, a vehicle is intended to transport some payload from the point of 
origin ho, X» on the ground, where it was at rest (V)>=0), toacircular orbit 
at altitude 4,(0:=0, V:=Vu,). The different control programs satisfying all 
these requirements will be referred toas admissible alternatives. 
The set of all admissible control programs D maybe identified with the 
set M inthe general formulation. We will consider integral 
functionals defined over the set D: these are integrals taken between 
certain time limits over some functions of the phase coordinates and control 
elements or functions of initial and final states, or linear combinations of 
these states. Examples of such functionals for aircraft are the flight range 


ty 
ra Vcos@dt, the flight duration t,—f , fuel consumption my—/m, the final 


altitude h,, the final velocity Vi, etc. 

We are thus dealing with the following general problem. 

Consider a system of differential equations describing a controlled 
system, 


g=F(t y, 4), (I. 3) 


where y=(y', y’,...,y") is the n-dimensional vector of the phase coor- 
dinates, u=(u', u®,..., ur) is the r-dimensional vector of the control 


elements. 
The variables y, u for every ¢ may take on certain values from some set 


V(t) defined by the additional constraints 
(y, u) C V(t). (I. 4) 
The projection of this set onto the space of the phase coordinates Y at 
every ¢ will be designated V,(t), y€ V,(t), and its cross section for any ? and 


y will be designated V.i(4,y),”CVu(t, y). Expression (I. 4) incorporates, in 
particular, the boundary conditions 


YyoCVylto), yr C Vy (tr), 


where y(to)=Yo, y(t) =. 
In the set D of the pairs of vector functions y(t), u(t) satisfying the above 


conditions, find a sequence {j,(t), us(t) } over which the functional 


t, 
f=f P(e, y, w)at + F (yo. m1) (1.5) 
to 


goes to its exact lower bound in the set, /(j.(¢), @s(t))—ini/. In particular, 


if we assume that the least value of the function / is attained on some 
element y(t), u(t) of class D, our problem is to find this element: 


1(y(), uw (£))= int /, (I. 6) 


Let us now consider briefly the generally accepted methods of solving 
problems of this kind. 


Pontryagin's maximum principle 


Let, for simplicity, the set V, be constant (independent of ¢ and y ), the 
boundary conditions fo, 4), Yo, 4: fixed, the set V,(t) for any t€(to, t;) coincide with 
the space Y (there are noconstraints on the phase coordinates), and F (yo, yi) =0. 

To solve the problem, we define a function of (2n+r+2) variables (the 
problem Hamiltonian) 


H(t, spo, PY, 4) = Spi (t y 4) + oP (ty, u), (1. 7) 


where P= (1, 2, -.., pa) iS an mn -dimensional vector, the first term on the 
right is a scalar product of n-dimensional vectors. The so-called adjoint 
system of n differential equations is then added to the starting equations: 


. oH 
arr (I, 8) 


The maximum principle is expressed by the following theorem. 
Theorem I.1. Let (f(t). uw(t)) ED be a solution of the problem, i.e., 
the point of D minimizing the functional /. Then there exists a vector 
(bo, (t)) which is not identically zero, where w(t) is the solution of (I. 8) 
for y=7(t), u=U(t), po is anonpositive constant, such that for all /€ [to, ¢,] the 
function H(t, y(t), wo, p(t), u) attains its absolute maximum on V, for u=i: 


H(t, y Ost, ¥(4), (= sup H(t, Y(t), Yo, PO, =P (1. 9) 


and the function «(‘) is continuous over [to, f,] . 

The proof of this fundamental theorem will be found in /6, 2, 3/. 

Equations (I. 8)—(I. 9) are the necessary conditions for an optimum 
solution y(t), z(t). Together with equations (I. 3), they constitute a system 
of ordinary differential equations of order 2n, closed by the finite relation 
(1. 9); on a given segment [fo, f] the solution of this system (y(t), p(t), u(t)) 
should satisfy 2n boundary conditions which constrain the phase path y(f) to 
pass through given points yo, y, at times ft) and ¢; , respectively. 
In other words, the maximum principle reduces the optimization problem 
to a boundary-value problem for a system of ordinary differential equations. 
The solution of this boundary-value problem, in general, may prove to 
be different from the sought optimum solution, since the equations of the 
maximum principle provide only the necessary conditions of optimality. 
However, if we can be certain that, first, the minimum of the functional 
exists (in the set D ) and, second, the solution of the boundary-value 
problem is unique, the solution obtained by this method is indeed the 
optimum solution. 

The maximum principle generalizes the classical optimum conditions 
— the Euler—Lagrange equations and Weierstrass's condition — to the case 
of a closed and bounded set V, (these are the conditions mostly encountered 
in practice). If the set V, coincides with the entire space U, we have 
from (I. 9) 


Au(t,y, wp, %) =0, (I. 10) 


where Hy=(Hw, Hu, ..., H,r) is the vector of the partial derivatives of 

the function H with respect to the components of the vector u. Equations 

(1. 3), (1.8), and (1,10) form a system of canonical Euler—Lagrange equations 
for the functions y(ft), p(f), u(t). The functions po, pi(t), e(t), ---, a(t) 
coincide in this case with the Lagrange multipliers, and relation (I. 9) 
coincides with Weierstrass's necessary condition. Indeed, from the 
definition of the supremum, (I. 9) implies that 


H(t, 9, p, @)—H(t, J, yp, 4) DO, (1.11) 


for any u€ Vu. 

In general, when the set V, is closed, the point at which the function 
H(u) reaches its maximum need not be a stationary point of the function, so 
that the Euler—Lagrange equations (1.10) are not always the necessary 
conditions of optimum. 

Note that various transformations mapping the set V, onto some open 
set, which are widely used by various authors (e.g., /7—9/) in conjunction 
with the apparatus of variational calculus, lead to the equations of the 
maximum principle (1.3), (1.8), (1.10), if their application is valid. 


Example I,1. As an example, let us consider the problem of the 
shortest path between two given points A and B. The set D of all the ad- 
missible paths is limited by additional conditions requiring single-valuedness 
of y(x), z(x) on (Xa, Xs). The functional and the equations describing the 
set D then take the form 


*B 
1=f Viewer de, (1. 12) 
A 


» dy . 2 

Y=, TE2=7 = Ui (1. 13) 
Y(Xa)= Yar Y(Xa)= Ye! (1. 14) 
Z(Xq)=Zyy Z(Xg)— Zz. (I. 15) 


Here the rectangular coordinates y and z are the phase coordinates 
of the problem, and the derivatives of the functions y(x) and z(x) are the 
control elements. 

The vector should also have two components. We designate them jp, 
and p,, respectively : p= (wy, pz). 

By /6/, we have 


Hl (X,%s Yor Per Ys Zs hy Vy + 4,0 +b VIF +22, 


i.e., the function H depends only on the vector and the control elements 
u, v. Equations (I. 8) thus take the form 


py =0; p.=0, 
whence it follows that the components of the vector are constant: 
‘py =const; .=const. 


Let us investigate the maximum of H with respect to u. For yo=0, 

H has a maximum only for p,=p.=0. The trivial solution po=yy,=.=0 
does not satisfy the maximum principle, and we therefore have to take 
Yo*xO. For any given x, po<0, py, p, the function H (u, v) has a single 
point of maximum, u,%, which is independent of x. 

Using equations (I. 8), we find that the shortest path is the line of constant 
slope, i.e., a straight line; since only one straight line can be passed 
through two given points, the shortest path is unique. Since the shortest 
path between two points certainly exists, we conclude that our solution 
indeed gives the shortest path — the straight line joining the points A and B. 


Beliman's dynamic programming method 


Consider a particular case of the general problem, when the initial 
values of ¢ and y are fixed, the final value of ¢ is given, but no constraints 
are imposed on the final value of y. Moreover, there are no constraints 


on the phase vector in the entire interval (fp, f;), so that the set V,(t) is 
an open domain. 

A suitable example is the optimal homing of a liquid- propellant rocket 
on a free target in vacuum. Constant engine thrust is assumed, but its 
direction can be altered at will. Ifthe differences in gravitational 
acceleration on the two objects are ignored, the homing equations relative 
to the free target may be written in the form 


(1. 16) 


where rr, V are the radius-vector and the velocity vector, pis the unit 
vector in the direction of the thrust (a control element), a(¢)is the thrust- 
produced acceleration (a,known function of time). The initial position ro 
and the initial velocity V, of the homing rocket are known. The problem 
is to minimize the square of the distance to the target r? at the final time 
t=t, - 

In tackling problems of this kind, Bellman's method uses a self-evident 
optimum principle /1/. 

Optimal behavior has the property that, whatever the initial state and the 
initial solution, subsequent solutions constitute an optimum in relation to 
the state ensured by the original solution. In other words: any component 
of an optimal process is an optimal process in relation to the current 
(instantaneous) state. 

The sought optimal solution and the minimum value of the functional 
clearly depend only on the initial time f and the initial state y. We thus 
have a scalar function S(t, y) defining the minimum value of the functional 
for given ¢t, y, which are treated as the initial values of the problem. In 
our example, this scalar function is the square of the minimum distance 
to the target at the end of the homing run, assuming that the homing 
missile was launched at the time ¢ from point r with velocity V. The 
function S(t, r, V) in our example is thus defined as 


S(t,7,V) = 7, (1.17) 


The optimum principle is expressed by the following functional equation: 


S(t, le [i P(e y, nat+scean), (I. 18) 
u t 


where {u} is the set of the admissible values of the control elements; tis 
some time from the interval {f, 4]; y(t) is the solution of (I. 3) in the time 
interval [f, t] with the control element u(f) and the initial conditions /, y; 
yx =y(t) is the value of this solution for ‘=1t. Allowing t to goto ¢ and 
assuming continuity and differentiability of S(t, y), we readily find the 
following partial differential equation: 


9S 4 ing (SF yw+sUt y,u)}=0, (I. 19) 
ot re oy 


with boundary condition (1.17). When this equation is solved, we obtain 
the function S(t, y); the optimum control element at any point (f, y) is found 
by minimizing the expression in braces. 

It is readily seen that in this way we not only find the optimum program 
for given initial values f, yo, but in fact solve a more general problem of 
the optimum behavior for any pair of values 4, y, treated as the initial 
values. This type of solution is known as optimum control 


synthesis. 
In our particular example, Bellman's equation (I.19) takes the form 


S48 easel eee pa (\=0. (1. 20) 
ov 

Seeing that a(¢)>0 and the scalar product . p is minimum when the 

unit vector P is antiparallel to s, 
pa a les (1. 21) 

we rewrite (I. 20) in the form 
SN = 28 la 0, (1. 22) 
or ov 


This is a nonlinear partial differential equation of first order. Its solution 
with boundary condition (I.17) gives a certain function S(¢,r, V). Expression 
(I. 21) then defines the field of optimum control elements, i.e., the optimum 
direction of the thrust vector for any state J, r, V. 

Like the maximum principle, Bellman's equation has its classical 
analog — the Hamilton—Jacobi equation of classical mechanics which differs 
from (1.19) in that u is obtained from the condition 


a fa SE Ot SUE yr a)} =O, (I. 23) 


Ou 


and not by minimizing the expression in braces. 

Equation (I. 19) is evidently more general and it essentially coincides 
with the Hamilton—Jacobi equation in cases when the given expression attains 
its minimum at a unique stationary point. 

Both methods — the maximum principle and Bellman's dynamic 
programming method — are widely used because they are organically linked 
with the particular features of modern applied problems and allow ina 
straightforward manner for the physical constraints imposed on the various 
elements. A wide range of optimum control problems have been solved by 
these methods. 

There exists another extensive class of problems (including typical 
flight dynamics problems), however, which are very difficult and often 
impossible to tackle by these methods. 

Let us consider some simple examples of this kind. 


Example I.2. Consider the problem of the optimal conditions of the 
vertical ascent of a rocket launched from the surface of the Earth, which is 
required to reach a given altitude with minimum fuel consumption (or, 
equivalently, to reach a maximum altitude with a given fuel charge) /5/. 

The equations of motion of the rocket are 


h=V; (1, 24) 
Y= — AEM — p(n) +Pim; (1. 25) 
maak (1. 26) 

ce 


Here, h, V, m are respectively the altitude, the velocity, and the mass 
of the rocket; X(h, V)is the drag; g(h)is the gravitational acceleration; 
P is the engine thrust, controllable between the limits 0<P<Pyax; c¢ is the 
nozzle velocity of the combustion gases (constant). Let Mo, Vo, Mo, fo be the 
given initial values of the altitude, the velocity, the mass, and time; fy, ¢, 
are the final altitude and time. 

A characteristic feature of this problem is the linear dependence of the 
right-hand sides of equations (I. 24)—(I. 26) on the controlled thrust P. 

Let us analyze the problem using Pontryagin's maximum principle. 

Let 


H WribedtiVin P= bY +h (—— X(VI— e+—)—% P, 


(1.27) 
; OH . -_ OH. = 0H 
= on? y= ww ; = Om 


be the Hamiltonian and the adjoint system of equations, respectively. The 
optimum solution is then selected from among the solutions of the system of 
differential equations (I. 24)—(I. 27) closed by the additional relation 


H (1, bos $554, V, m, P) == max H (th, $2, 3, 4,V, m, P). (1. 28) 
0<P<P. 


max 


The sought solutions should satisfy the following boundary conditions: 
for t= ’ 
h=hy V=Vo; m=; 
for ¢=t,, 
h=h; tbo (£,)=0; A (hh, de, 3, h, V, m, P),. 1,=0 
(f)) > 0. 
Let us consider (1. 28) in more detail, Seeing that the function H depends 


linearly on the controlled thrust, we readily conclude that depending on the 
sign of the coefficient before P (the switching function), 


M=h2o/m—ps/c , (1. 29) 


this equation is satisfied by 


1) P=P,., for M>0 
2) P=0 for M<0 (I. 30) 
3) M=0 


In case 3, H is evidently independent of P, sothat any P may be chosen 
from {0, Pmax]. 

In our example, case 3 in (I. 30) may be identically satisfied over some 
time interval The corresponding solution in the theory of optimal processes 
is known as the singular control solution. 

The corresponding control function is obtained by making the finite 
equation M=0 consistent with system (I. 24)—(1. 27). 

Setting the total time derivative of M equal to zero, 


Na we oe Me 9; (1. 31) 


and inserting the right-hand sides of (1. 24)—(1. 27) together with the equality 
H=0, we obtain a new finite relation, which is independent of P: 


N=—|« (+-4)-*9 + £=0. (I. 32) 


Repeating the same operation for (1. 32), we obtain a linear equation for 
P, from which the sought control function is found. 

The construction of the optimum program generally involves the solution 
of a boundary-value problem for (I. 24)—(I. 28). As we have seen above, 
the rationale of the procedure based on the necessary conditions only is 
twofold: first, the existence of the sought optimum program must be 
established and, second, the uniqueness of the solution satisfying these 

necessary conditions should be proved. The 
existence of an optimum program for the 
problem being considered (the right~hand 
sides of the equations (I. 24) —(I. 26) are 
(1,,V;) linear functions of the control elements) 
follows from the general considerations 
of the theory of optimal processes. The 
uniqueness of the solution of a boundary- 
value problem for a nonlinear system of 
differential equations, on the other hand, 
is by no means certain. The existence of 
singular control markedly aggravates the 
situation. Indeed the singular control 
program is the set of switching points 
M=0, so that at each of these points we 
may take both P=0 and P=Pmay 
FIGURE I. 1 (Figure I. 1). 


Not every trajectory obtained in this way need be consistent with the 
boundary conditions, and there are in fact examples showing that there 
may be infinitely many such trajectories. 

We thus see that complete solution of problems of this kind requires 
sufficient or, at least, additional necessary conditions of optimum, which 
would enable us to eliminate all the incompatible solutions. 

The dynamic programming method in this case leads to a nonlinear 
partial differential equation 


os as os 1 
a oe Vor (Oe) + 


+ max ae Bho aot =) P]=0, 


om om ic 


OSPKP aay 
or 
BS v4+2(-5 X (HV) —B(H)) +4 (M)=0, (1. 33) 
where 
seyjaa) MP tee M = 2S + “> 0, 
0 for M<0O. 


No sufficiently general theorems of the existence of solutions for these 
equations (with a non-differentiable left-hand part) are available; nor are 
there regular methods for their solution (numerical or otherwise). 

We are thus faced with a contradiction: in those cases when the 
additional necessary and sufficient conditions of optimum are required for 
constructive purposes, they are in fact inapplicable. New effective 
optimum conditions of singular control thus have to be found. 

Example JI.3. Let us consider the same problem as in Example I. 1 
with one difference: the engine is not throttled, and it is either on or off 
(the thrust P may only take on two values, 0 and Pmax). In this case, as 
is readily seen, the equation M=0 is no longer compatible with equations 
(I. 24) —(1. 27) of the maximum principle, so that the singular control 
solution is not included among the allowed solutions of these equations. 
However, even if a unique solution has been obtained for the boundary-value 
problem of equations (I. 24)—(I. 27), we can by no means be certain that 
this solution is the optimum, as we do not know that the optimum solution 
exists among the admissible solutions. The problem is solved in this case by 
a minimizing sequence, providing a so-called sliding control program, It 
will be shown in what follows that a typical solution of this problem is a 
succession of thrust programs with infinitely increasing thrust switching 
frequency around the line M=0, N=0. 

We are thus faced with a new problem of finding appropriate minimizing 
sequences, 
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Problems of this class can be replaced by another problem, for which 
the optimum solution is known to exist. However, the minimizing sequence 
of the original problem corresponds to the singular contro] solution of the 
new problem, and in the final analysis the sliding control problem reduces 
to a singular control problem. Bellman's equation for this problem 
precisely coincides with equation (I. 33), with all its consequences. 

Example I.4. Suppose that the thrust in Example I. 2 does not have 
a maximum value (this is a typical situation in the mechanics of space 
flight). In this case, as is readily seen, both the function H and the left- 
hand side of Bellman's equation are infinite for positive M, so that neither 
the maximum principle nor the dynamic programming method are 
applicable in their original form. Moreover, with arbitrary boundary 
conditions, the admissible solutions of the problem do not include the 
optimum solution, so that we have to construct a minimizing sequence, 
although of a different type: a sequence allowing indefinite growth of the 
thrust. This is a typical feature of systems with unbounded linear control 
elements. In additiori to these fundamental difficulties, there are 
difficulties of secondary importance associated with the actual numerical 
solution of the problems. This remark primarily applies to Bellman's 
method. How are we to solve numerically equation (1.19), say? Consider 
the solution by the grid method, when we have to compute the partial 
derivatives of the function S(f,, y) at certain tabular points (the function is 
known for {t=t,). 

From equation (I. 19) we then find the values of 0S/d! at the same tabular 
points and approximate to the values of S for t=f,—A using the equality 


Sq—-M=S (4) - SEH a, 


The same procedure is then repeated for S(t,;—A, y), etc., until we 
reach fj. Consider the case of a midcourse maneuver in space. Then 
r, V are three-dimensional vectors, and S(#, r, V) is a function of seven 
variables. The numerical grid consists of ten values of each variable for 
every ¢. It thus comprises 10® points, at which the function S should be 
computed and the results stored. Such a tremendous volume of data 
cannot be stored in the immediate-access memory of most computers. 

And yet, this is a very modest number considering the required 
accuracy. If the grid spacing is reduced only by a factor of 2, this number 
will increase by a factor of 25= 64, so that even the most optimistic 
forecasts of the future development of computers can hardly catch up with 
this "growth". Suppose that equation (1.19) has been solved; it is still not 
clear, however, how we are to store and use the solution, which is a 
function of seven variables defined by its numerical values. Certain 
techniques can be applied in order to reduce this curse of dimensionality", 
to borrow Bellman's expression, but it cannot be eliminated completely, so 
that at the present stage nonlinear problems of third or fourth order 
apparently constitute the limit as far as Bellman's method is concerned. 

The maximum principle does not lead to such catastrophic effects, 
and it is much more powerful and more promising inthis respect. In view 
of the difficulties involved in the solution of boundary-value problems, we 
have to concentrate on developing alternative methods of optimal solution, 
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which will not require straightforward comparison of all the different 
alternatives satisfying the necessary optimality conditions. This is a highly 
important and as yet unsolved problem. 

A new approach to the solution of the general problem, considered in 
the next section, opens new possibilities for overcoming these fundamental 
difficulties. 


Optimum principle based on a reduction of the 
given problem to a trivial problem 


Consider the problem of minimizing the functional 
t, 
L(y, a= —[ RAY Bdt+® (Yo, 41) (I. 34) 
to 


on the set E of pairs of vector functions y(t), u(t), which differs from the 
set D only in that y(¢) and u(t) are not related by differential equations. 
This problem will be called a trivial problem. The following almost 
obvious optimum conditions apply to this problem. The functional /is 
minimized on the set E by the vector functions y(t), a(t) whose values at 
every interior point of the segment [f, ¢,] maximize the integrand R(t, y, u) 
over the set V(/) and the values at the integration limits fo, ¢; minimize the 
function @® (yo, y;) over the set Vy(fo) +Vy(t1): 


R(t, y, z)= sup Rit, y, wEC(fo, 4; (1. 35) 
(y, #)EV(E), 
® (iy 1) = inf P(yo, y). (I. 36) 
Yo EV y(to) 
yy evi) 


In other words, the problem of minimizing the functional over the set of 
functions y(f), u (f), i.e., over an infinite set of numbers, is thus reduced 
to finding the maximum of a function R (tf, y, uw) of nt+rvariables y', u* for 
every given ¢€(, ¢:) and the minimum of the function @M(y, y,) of 2n 
variables. 

These conditions, with slight reservations, are generalized to the case 
of a minimizing sequence {is(t), s(t)}. 

This sequence satisfies the conditions 


R (2, Ys, us) sup Rit, Y, u), tE(to, th): (1. 37) 
(y, weV (t) 
D (Ys (40, Ys inf (yo, yy). (1. 38) 
YN y (fo) 
wy (ft) 
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In view of the simplicity of these conditions, it is advisable to reduce 
the general variational problem with differential constraints to a trivial 
problem, i.e., we have to find the functions R(t, y, u) and ®(¥yo, y,) such 
that the solution of the trivial problem is also a solution of the original 
problem. This is the essence of the variational method considered in this 
book. 

Let us present, without proof, the basic result on which the variational 
principle devolves. The functions R and @ are sought in the following form: 


Rit y D= Delt y O—-P(h y, +e; (1. 39) 
i=) 
D (yo, WI=F (Yo. W+E(4 WI—9(fo, Yo) (1. 40) 


where 9,/, 9, are the partial derivatives of some function p(t, y) which is 
continuous and differentiable over V,(t) . 


Let 
w(j= sup R(Z, y, 2); (I. 41) 
4 ueV (1) 
m= inf P (yo, 41). (1. 42) 


Yo EVy (lod, WE Vy(ty) 


Theorem I.2. Consider a pair of vector functions y(t), w(t) from the 
set D and a function (¢, y), such that 


1) R(t, yd), 2 (4) =v(d); (1. 43) 
2) ®[y (4), y(t))=m. (I. 44) 


This pair y(t), Z(t) minimizes the functional /. The functions R(t, y, «) 
and @(yo, y:) mentioned above thus have the form (1.39), (1. 40) and are 
expressed in terms of the function g(t, y) of the n+1 arguments f, y', 
i=l, 2,...,, which satisfies the conditions of the theorem. 

In a more general case, when our problem is to find a minimizing 
sequence, Theorem I. 2 is formulated in a somewhat different form. 

Theorem I.3. Consider a sequence {js(t), @s(t)} from D anda 
function @, such that 


1) Rit, ys), AsO) > oO, £€ Gy, 4): (1. 45) 
2) ® [ys (bo), Ys (4) > m; (I. 46) 
3) p(t) is piecewise-continuous and m is a finite number; 

4) the sequence R{f, ¥s(t), ts(t)] is bounded. 


Then this sequence minimizes the functional /on the set D. Conditions 
1 and 3 may be replaced by a more general, though less obvious, condition 


J RUA ds), wD dt— | wate. (1. 47) 
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The main difficulty associated with the application of this principle is 
the definition of the function 9(#, y). 

The definition of @ essentially determines the method of solving the 
problem. We will see at a later stage that one of the possible definitions 
of @ leads, in particular, to the equations of Pontryagin's maximum 
principle as the necessary conditions for the maximum of the function R, 
while another method leads to Bellman's equations. These two alternatives, 
however, do not exhaust all the different possibilities. 

The arbitrariness inherent in the definition of @ may be put to work so 
as to devise the best procedure for each particular problem. 

Let us consider two simple examples which illustrate this point. 

Example 1.5. Minimize the functional 


I= ( (y?—u)dt 


hens 


subject to the constraints 
y=u; [ul <1; y(0) =y¥(1) =0. 


Here y and uw are scalar functions. 
The function R(t, y, u) has the form 


R=ogyu—y tuto (1.48) 


(The function @ is a priori minimized by Theorems I.2—I.3, since yo. 4; 
are fixed numbers.) The function @ is defined so as to make R independent 
of u. To this end, it suffices to take 


gy=—l, (I. 49) 


whence 


=—ytC(t), 


where C(t) is any differentiable function. 
Thus, 


R=—y?+C,, (1. 50) 


and this function has a maximum for y=0 (with any wu). 

Setting 97(t)=0 in the equation y=uw, we find u(?) =0. Since y(t) 
satisfies the boundary conditions, the pair y(t)=0, a(t)=0 is the sought 
solution minimizing the functional. Note that we have obtained the 
minimizing solution without solving any boundary-value problems. 

For comparison, let us consider another definition of 9, which leads 
to Bellman's equation. According to this approach, we find the maximum 
of the function R with respect to uw for any ¢ and y and set it identically 
equal to zero: 


supR(t, y, Y=letil—y+9,=0. 


\ul<1 
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’ This is a fairly complicated nonlinear partial differential equation (with 
a nondifferentiable left-hand side). The first technique is thus much 
simpler. 

Example 1.5. Under the same constraints, minimize the functional 


1 
f= | (y?—2°) dt. 
! 


The function R has the form 
R= g,u—y? +u?+q. (I. 51) 


We take g,=0; g=0. The function R, for any ¢ and y , thus has two 
maxima, u=-+1]. Its maximum with respect to y, as before, corresponds 
to y=0. We thus have 


It is readily seen that neither of the two solutions maximizing the function 
R satisfies the differential equation y=u. However, the existence of three 
points at which R has maxima enables us to construct a sequence of 
solutions of the equation y=u such that R goes to a maximum for every f¢, 


R(t, s(t), ts(t)) p(t) =1. 


This sequence is constructed as shown in Figure I. 2. 
As the thrust switching frequency indefintely 
increases (S—oo), wehave y.(f)—-0. Forany S 
we have u> (t)=1, 
y ¥s(t) This sequence provides conditional sliding 
7 solution with zero closeness function y(t) =0 
and basis controls t,2.=+1. 
The above examples have their own charac- 
teristic features, as the previously considered 
0 7 ¢ problems. In each case, the definition of ¢ 
y(t) took into consideration the particular features 
2 of the problem, and thus led to an effective 
solution of the problem by a method entirely 
different from the traditional techniques, 


FIGURE I. 2 
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Chapter I 
ELEMENTS OF THE THEORY 


§ 1.1. STATEMENT OF THE PROBLEM 


Our problem is to minimize the functional 
4 
T= J F(t, y, w)dt+F(yo ys) (1.1) 
bo 


on a set D of the pairs of vector functions y(t), u(t) satisfying the following 
conditions: 

a) The vector function y(t) =[y'(t), ..., y"(t)]is defined over the segment 
[to, 4]; its components y! (f) (i=1, 2, ..., n) are continuous over [fo, ¢,] and have 
a piecewise-continuous derivative; for any fixed t C[fo, 4,], the vector y(t) 
belongs to a given region V,(t) in an n-dimensional vector space. 

b) The vector function u(t) =[u! (t),..., uw” (¢)Jis defined over [fto, t)]; its 
components w/ (t)(j=1, 2, ..., 7) are continuous everywhere in (fp, 4), 
except for a finite number of points, where they may have discontinuities 
of the first kind; for any t€ (fo, 4:), y CVy(t), the vector u(t) belongs to a 
given set Vu(t, y). 

The function F (yo, y,)is continuous for all yo, yi, where 


you y (to), yi=y(h). (1.2) 


The conditions imposed on y(t) and u(t) define a set V(t) of admissible 
values of combinations of n+rnumbers (y!, w/) for every ¢ € (f, t), a region 
V, of the admissible values of (#, y)in the (n+1)-dimensional (4 y) space, and 
a set V of admissible combinations ofn+r+Inumbers (y', u/, t) 

(i=1, 2,..., 2, j=1, 2,..., r). In addition to the above conditions, the pair of 
vector functions y(t), u(t) Should also satisfy the system of n differential 
equations 


y=l(t, y, 4), (1. 3) 
where fa (BP, 9 eas ,/")- 
The functions fi (¢, y, wu), 4=0,1,...,, are defined and continuous for all 
tyu. 
The vector y= (y!, y?,..., y") is generally known as the phase vector or 


the state vector, and its components are the phase coordinates. The 

vector u=(u!, u?,...,u") is known as the control vector, and its components 
are the control functions or controls, Formally the phase coordinates 

are distinguished from the control functions in that the differential equations 
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(1.3) contain derivatives of the phase coordinates and do not contain 
derivatives of the control functions. The sets V,(t) and Vy,(t,) of the 
admissible values of Yo, y; constitute the boundary conditions. 

The argument ¢ may be identified with time, any ascending function, 
a phase coordinate, or some other parameter. 

The problem is formulated as follows: in the set D of the pairs of 
functions y(t), u(t) find a pair g(t), Z(t), which minimizes (maximizes) 
the functional /. 

If the set D does not contain such a pair of functions, the problem is 
formulated in a slightly different form: find a sequence {js(t), us(t)} CD 
such that for S—+oo the functional (1.1) goes over this sequence to its 
lower~bound value on the set D. 

The sought pair of functions #(t), z(t) will be referred to as the optimum 
solution or the absolute minimizing solution, and the sought sequence of 
¥s(t), ws(t) will be called a minimizing (optimizing) sequence. 

We will mainly be concerned with minimizing a given functional. The 
problem of maximizing the functional can always be reduced to the 
minimization problem by a simple reversal of the functional sign. 


§ 1.2. THE SUFFICIENT CONDITIONS OF AN 
ABSOLUTE MINIMUM 


In this section we will prove a theorem which constitutes the basis for 
all the methods of solving the optimization problems described in this book. 
First, however, we have to prove one lemma. 

Let the functional / (v) be defined over some set M, v@M. Let 

inf /(v)=m. 
ee, (1.4) 

Find the absolute minimum of / on M, i.e., find a sequence {vs} CM, 

such that 
lim J (s)=m. 
So s) (1. 5) 

We call this sequence a minimizing sequence and it is said to minimize 
the functional / over the set M. If there exists an element 3 € M such that 


(0) =m, (1. 6) 


we may take vg=? (S~1,2...). In this case the problem reduces to finding 
the element 2 € M for which the functional / attains its absolute minimum 
over the set M. 

Consider a set N ) Mover which a functional L is defined, so that 
L(v) =/(v) for v€ M. 

Lemma. Consider a sequence {%s}( M satisfying the condition 


lim L(v,)=1, (1.7) 


Seo 
f=infL(v) vEN. 


This sequence minimizes the functional / over M: 


where 


(v5) > m, 


Sra 
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Proof. Let (1.7) hold true. By definition, L(#s)=/(vs). We will now 
show that/=m. If this is not so, then from M ( N we have m>! so that 
m—l>e>0. Since {7s} C M, we have /(vs)—l>efor any S. This, however, 
contradicts (1.7). Hence/=m, and (1.5) follows from (1.7). Q.E. D. 

This lemma enables us to replace the problem of minimizing a functional 
over a set M by an analogous problem over a larger set NV. The "augmented" 
problem may prove to be simpler if the structure of the set Nis relatively 


simple. 

Consider a function (t, y) which is continuous for all ¢, y and has 
continuous partial derivatives @,, gy=(@,',-.-, ®y") for allt, y, except a finite 
number of sets {=const in the (¢, y) space. Now construct the functions 

Dy, WI=F(Yo W+P(f ¥1)—F(to. Yo); (1.8) 
Rit, Yy, u)= gy f (t, y, u)— fr, Y, uy+ oi; (1.9) 
p(t)=sup R(t, y, 4). (1.10) 
(yu)E V(t) 


Here q, is ann-dimensional vector function, and the first term on 
the right in (1.9) is a scalar product of n-dimensional vectors: 


n 
wl=> pyi ft. 
i=) 


The vectors y and u on the right in (1.10) are assumed to be independent. 

Theorem 1.1. Consider a sequence f{ys(t), us(t)} C D. For this 
sequence to minimize the functional / over the set D, it is sufficient that 
there exists a function @(t, y) such that 


1) Rit, ys(t), ts(Ol— w(t), £€ (fo. 44)" (1.11) 
2) ®(yos, Yrs) inf ® (yo, ¥1)> —00. (1.12) 
Soe YoEV y (fo). WriEV y (Er): 


3) u(t) is piecewise-continuous over [fo, 4]; 
4) there exists a finite number Q such that for any S<oo and any ¢€ (fo, f;) 


R(t, Fs(t), Us(t)) >Q. 


Remark. If the absolute minimizing solution (y(t), z7(t)) € Dexists, 
Theorem (1.1) takes the following form. Consider the pair ((t), 7(t)) € D. 
A sufficient condition for the functional (1.1) to attain an absolute minimum 
on this pair is the existence of a function @(¢, y), such that 


1) RUE, yd), 2 (=e) (1.11%) 
almost everywhere in (fo, £,); 
2) 2 (y, y= inf ® (yo, 41)- (1.12%) 


HEV y(to), Yi€V yltr) 

Conditions 3 and 4 in this case are satisfied automatically. 

Proof. Here the set M of the previous lemma is identified with the 
set D of the pairs of vector functions y(t), u(t). The set Nis defined as the 
set E of the pairs of vector functions y(t), u(t) which differs from Din the 
following two respects: first, the functions y! (tf), (i=1, 2,...,1) may have 
discontinuities of the first kind at a finite number of points of the segment 
[to, 4] and, second, the vector functions y(t), u(f)are not related by 
differential equations (1.3). We define the following functional on E: 


* Convergence in the measure is implied here: the measure of the set of pointst € (to, ¢,), where u(t)—Rs>e, 
goes to zero for any given €. 
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t, 
L(y (2), 4())=® (Yo, 41)— J R(t, y(t), a(é)de. (1.13) 


On D we have L=/. Indeed, using (1.3) and (1.9), we get 


R=9—f?, (y(t), u(t) €D, (1.14) 
where 9 is the total derivative of the function 9 in virtue of (1.3). 

Inserting (1.14) in (1.13) and remembering that y(/) are continuous 
functions for u(?))¢D, we obtain L=/, 

Suppose that there exists a function g(t, y) and a sequence {s(t}, ws(t)}€D 
satisfying conditions 1 through 4 of the theorem. We will show that in this 
case {¥s(t),Zs(t)} is a minimizing sequence of the functional L over the set E, 
ice., 


lim L(ys(e), #5 ())—1 = 0, 


where 
tint L(y(), u (t)). (1.15) 


Since the functions y(t) which belong to the set E may have discontinuities, 
the first and the second terms in expression (1.13) for L are independent. 
Therefore 


t 
l= inf ® (Yo, y,)—su t, y(t), u(2)de. (1.16) 
cane ntti evyean? eo 4) sup FRU ¥iO, HH) 


We have 


lim L (ys (t), its (t))— f=lim ® (Yos, Ys)— 


S+0 S+e 
t, 
—inf ® (yy, y)+sup | R(t, y(d), a) at— 
(Yoo VIE y(ta)tV y(t) EF ZF, 
ty = 
—lim f R (ys (t), ts (0) at. (1.17) 


S+0 to 


Using (1.11), (1.12), and Lebesgue's theorem on the limit of an integral, 
we write 


is 2 t 
lim L (ys (4), 45 (t))—l=sup ) R(t, y (A), a(é))at— 
S-+ 0 E 


t a = 
—lim fre ys(t), Us (t)) dt= 
at 


t, ty = = 
sip JR yO, ayat— f (limR(ys (0, as () dt = 
to , S00 
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fy 


= sup j Rit, y(t), a())dt— feat. (1,18) 


to 


Suppose that our proposition is not true, i.e., the given sequence does 
not minimize the function Z over E. Then there exists a number ¢>0 such 


that 
lim L (ys (2), 4s (4) —L>e, 


Peo 


and by (1.18) 
t, 


swe { RC y(t), a(d)d He) ede (1.19) 


According to the definition of the exact upper bound, there exists a 
sequence {y,(f), u,(¢)} ( £ such that for a sufficiently large & 


ty t, 
sup { RU 9, w(D)dt— (RW we, O)AE< >. (1.20) 
to to 
Subtracting inequality (1.20) from (1.19), we find 


t, t, 
JR wl, al dt—[eOat> 5, 
fo fy 


or 


t, 
{IRE Ww HO)— sup RE y, Wlde> + D0. (1.21) 
(y.we V(t) 2 


to 


The integrand is non-positive almost everywhere in (fo, 1), so that the 
integral is also non-positive, and inequality (1.19) breaks down. 

The sequence {js(f), iis(t)} thus indeed minimizes the functional L over E. 

In virtue of our lemma, this sequence also minimizes the functional / 
over D, Q.E.D. 


§1.3. PROBLEMS WITH A FREE BOUNDARY 
So far we treated [f, ¢;] as a segment with fixed end points. In some 
problems, however, ?¢, is not fixed and may be chosen from considerations 


of optimality. 
We again consider the minimum of the functional 


t, 
[= P(t, y, a) dt+F(Yo. Yn A). (1.22) 
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Here ¢, is an element of the set tT of points of the ¢ axis contained in the 
segment [to, T], T<oo. The vector functions y(t), u(t) satisfy all the 
conditions listed in §1.1. 

The sets V,(ft) and V,(¢) are assumed to be defined over the segment [to, 7]. 
The function F(t, yo, y1)is defined and continuous for ft, € t, Yo € Vy(to), 

y, € V,(t:). Each element of the set D is now a combination of a number 
t,€ tand the vector functions y(t) and u(t)defined over the segment [to, ti]. 

The analog of Theorem 1.1 in this case is formulated as follows. 

Theorem 1.2. Consider a sequence {fjs, ¥s(t), ¥s(t)} C D. For this 
sequence to minimize the functional / over D, it is sufficient that there 
exists a function p(t, y) (see § 1.2) such that 


1) R(t, ys (2), Us (4) > B (A) (1.23) 
on (4, tis); 
2) ® (As. Yos. Yis)> inf ® (t1, Yo. 1) > — 9; (1.24) 


HEHE VY (fo) MEV YE) 
3) p(t) =0 on (bo, 7); 
4) there exists a finite number Q such that for any S<oo and any 
t€ (to, tis): 
R(t, ¥s(t), Bs(t)) >Q, 


where R(t, y, u) and p(t) are defined by (1.9), (1.10). 

Proof. We define a set E of the triads (t,, y(t), u(t)) which differs 
from Din two respects: first, the functions y' (t) (i=I, 2,..., 2) may have 
discontinuities of the first kind at a finite number of points of the segment 
[t, T] and, second, the vector functions y(f),u(t) are no longer related by 
differential equations (1.3). On E we define the functional 


L(t, yO, 4) = ® (ty vo. wf RU y, wae. (1.25) 


Clearly L=/ for (t,, y(t), u(t)) €D. We will now show that conditions 1 
through 4 of the theorem are sufficient for the sequence fi1s, ¥s(t), ws(t)} to 
minimize the functional L over E. In all other respects, the proof coincides 
with that to Theorem 1.1. 

Let 

R(t)= inf Li, y(é), # (2), (1.26) 
(y(t), a(E))€ E(t) 
where E(t,) is the section of the set £ for a fixed t,, i.e., the set of elements 
(41, y(t), u(t)]€ B for a fixed ft, 


$(t)= inf Dt, Yos Yr) (1.27) 
(Yo. WE V y(to) +V y(t) 
m= inf p(t). (1.28) 
GErt 


Using (1.26), (1.27), (1.28) and condition 3 of the theorem, we may 
write 
f=inf L(t, y(t), 2(f))=ink &(4,) = inf o(4;) =m. (1.29) 
E het tGt 
On the other hand, the limit of the functional L over any sequence 
{tis, ys(t), us(t)} CA satisfying the conditions of the theorem is also equal 
tom: 
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Hint (se ys(t), 4s())=lim {® (445, Yos, Yis)— 
> 00 S+ 0 


t 
=("R (é, Ys (t), Us (at =m ® (4s, Yoss 415) as 
ty => OO 
—lim {rR , ys(), us(t), ws (2) de. (1.30) 


Since the set tis bounded, we see, using the properties of the sequence 
R(t, ys(t), us(t)), that the second term vanishes. Therefore, using the last 
equality together with (1.24), (1.27), (1.28), we find 


lim L (4s, ys(¢), 45())=m=L, 


S00 


i.e., the sequence {t1s, s(t), us(t) } indeed minimizes the functional L on E. 
The rest of the proof is conducted as for Theorem 1.1. 
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Chapter II 


SOME METHODS OF SOLUTION OF VARIATIONAL 
PROBLEMS USING THE SUFFICIENT CONDITIONS 
OF THE ABSOLUTE MINIMUM 


In this chapter we consider some methods of solution of variational 
problems based on Theorems 1.1 and 1.2; these are methods which show 
how to choose the function ¢(¢, y) so as to reduce the treatment to standard 
classical methods — Lagrange's method and the Hamilton-Jacobi method — 
modifying them to such an extent that a complete solution of the problem 
is obtained, i.e., the absolute minimizing solution is found. 

All the results are derived for the case when the minimizing solution 
(y(t), a(t)) / D exists. A more general case is treated in the next chapter. 


§2.1. THE LAGRANGE-PONTRYAGIN METHOD 


This method enjoys the widest applicability, although it is not always 
the simplest. The underlying idea is to find the partial derivatives of 
o(t, y) at the points of the minimizing solution 


wilt) =O lt F()] fS1, 2... 0, (2.1) 


and also the minimizing solution (7(¢), w(t)) itself, while the proof of the 
existence of the function g(¢, y) satisfying the conditions of Theorem 1.1 is 
postponed to a later stage. 

For simplicity, let t; be fixed, let Vy(¢) be an open region for allf€ (to, 4), 
which for t=f) and /=/, reduces to the points 


Y (fo) = os ol) = i: (2.2) 


the set V, is independent of y. In what follows, the functional / is conven- 
iently replaced by a functional to/, where wo is a positive constant. This 
substitution evidently does not affect the essential features of the problem. 
We moreover assume that the functions fi (ft, y, n), ¢=1, 2,..., 2, are 
continuous and differentiable for all f,y,u. If we further demand that 
p(t, y) be twice continuously differentiable at the points corresponding to 
the presumably minimizing solution y(t), we can write the necessary 
conditions of a maximum of R in the form 
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Re (ty, A)=ewrf (ty, ey Sir(t, y, 4)— 
—Yohur(t, 4, 4)+ eyt=0; 
k=1, 2,..., 2 (2.3) 
gut (ty, t)—b I(t, y, 2) = 
=suple f(t, y, u)— f(t, y, 4)). 


4eV, 


Using (2.1) and the equality f(t, 7, z)=y, (g(#), a@(t))€D, we obtain 


Re lt o(), C= 2 ty + Hy 0; (2.4) 
A(t, ¥(), y, wl=supA[t, y(A), 2], (2.5) 
ueV 
where y= Ey) _ is the gradient of the function 9(f, y) at a point 
yo lye) 
y=y(t)in the space Y, 
Alt, 4, W=P(NF (Ly, U)—HfP(t, y, H). (2.6) 


Conditions (2.4) and (2.5), together with (1.3), constitute 21+r equations 
in 2n+r unknown functions 


yi (t), p(t), w(t), f= 1, 2,..., 2 9=1,2,..., 7. 


Together with boundary conditions (2.2), these equations define an 
extremal, i.e., a pair (7(t), w@(t))€ D which satisfies the necessary 
conditions (2.3) for the supremum of R(/, y, u), and a vector function y(f) or, 
by (2.6), the gradient of the function ¢(/, y) at the points of the extremal. 

Conditions (1.3) and (2.4) constitute a system of ordinary differential 
equations for the functions 7'(f). w(t), i=1. 2,..., 2, closed by the finite 
relation (2.5), according to which for u=ua(/) the function HA[f, 7(t), p(t), 4) 
reaches its largest value compared to its values for all the admissible 
control functions for every fixed {€ (fo, f:). Equations (1.3), (2,4) may be 
written in the form 


-__ OH 
i= 5 (2.7) 

: 0H 
s=—-—, 2.8 
ay (2.8) 


These equations constitute a so-called Hamiltonian system and 
the function H(t, y, p, u) is known as the Hamilton function or the 
Hamiltonian. 

The variables y! and p; and the respective systems (2.7) and (2.8) are 
said to be conjugate. 

Equations (2.4), (2.5) prove to be not only the necessary conditions for 
a maximum of the function R, but also the necessary conditions for a 
minimum of the functional, provided the strict inequality po>Ois replaced 
by the weaker condition po>0. This is so because, after the inequality 
adjustment, these equations coincide with the necessary conditions for the 
minimum of the functional corresponding to Pontryagin's maximum 
principle. 
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At all points of the space TXY which are not points of the extremal, the 
function g(/, y) may be defined quite arbitrarily. If there exists a function 
o(t, y) such that at the points of the extremal y(f), w(f) both the necessary 
and the sufficient conditions for the maximum of R(f, y, uw) are satisfied for 
every fixed ¢ C(to, f:), Theorem 1,1 indicates that the extremal (y(t), @(t)) 
is in fact the absolute minimizing solution. This result can be summarized 
by the following theorem. 

Theorem 2.1. Let the functions y(t). @(t), p(t) constitute a solution of 
equations (1.3), (2.4), (2.5). A sufficient condition for the extremal 
(g(t), @(f))to be an absolute minimizing solution for the functional (1.1) is the 
existence of a function g(t, y) which is continuously defined for all 
t€ [to, hl, yE Vy(t) and is twice differentiable with respect to y for all 
y€ V,(t), t€ [to, ti] and piecewise-differentiable with respect to f, such that 

1) etl y=), = 1, 2,..65 

2) Rit, y, w= sup Rit, y, 4) FE (to 41) 

(y. WG Vit) 

This theorem gives a sufficient condition of the absolute minimum which, 
unlike Weierstrass's sufficient condition of variational calculus and its 
generalizations /2/, does not require the construction of the field of 
extremals nor the derivation of any other extremals except the one being 
considered y(t), w(t). 

The algorithm of the method reduces to solving a boundary-value 
problem for the system of ordinary differential equations (1.3), (2.4), (2.5) 
and choosing a function g(t, y) which satisfies the conditions of Theorem 2.1 
or, more precisely, proving the existence of such a function. Thisisa 
highly significant qualification: it shows that in Lagrange's method we do 
not have to determine the particular form of the function g(t, y) satisfying 
the conditions of Theorem 2.1, and it is sufficient to prove the existence of 
this function only. 

In particular, we may start with the expression 


p(t, y) = Will) y! +oij(t) Ay! Ay’ , (2.9) 
4j=1,2.....4, 
where 
Ay! =y'—g (1) 
and the functions 
(=F It, ¥ (OI (2.10) 


are continuous and piecewise-differentiable over [to, 1, Theorem 2.1 then 
leads to Corollary 2.1. 

Corollary 2.1. For the extremal j(t), a(t) to be the absolute 
minimizing solution of the functional (1.1), it is sufficient that there exist 
n? functions o:,(¢) such that 


Rit, o2), of), y, 4J=p(d), (2.11) 
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where 
R=(b +57 dy) Fi, y, U)— SOL, ys He) + 


+ 4 (Oy'+ Shen Oaytay). (2.12) 


Example 2.1. Let us apply the above method to investigate the 
functional 


~ 


/= | (a? — y?)adl, 


b (2.13) 
yea; y(O)= y(t) -0. 
First let us find an extremal. We have 
f= w—u?t i. 
Equations (2.7), (2.8) are written in the form 
p= — 2y; y= u: 
Huge -- 2<0. 
If (;542m (m=1, 2,...) the unique solution of this system satisfying the 
boundary conditions is 
y(t) =u(t) = y(t) =0, t = (bo, fh). (2,14) 
Let 
p(t. y) =o(t)y®: Py= 2 oy. 
Then 
R(t, y, w= qyi—wteyr +g (1a) y? + 2oyu—u?, (2.15) 


For the extremal (2.14) to be an absolute minimizing solution, it is 
sufficient that there exists a continuous function a(t) such that the quadratic 
form: R(t,y.u) be positive definite for every (€(0, 4). The necessary and 
sufficient condition of this is 

~ Rig = 220: RyyRag— Rint» — 41 +52 +9) >0. 

The first inequality is satisfied identically, whereas the second imposes 

a constraint on o(f): 


I+o2+0<0; t€ (0, 4). (2.16) 


Thus, for the functional (2.13) to attain an absolute minimum on the 
extremal (2.14), it is sufficient that there exists a function a(t) satisfying 
(2.16). For example, let = 1/3. Let a(/)=--2/. It is readily seen that 
(2.16) is satisfied everywhere on the segment [0, 1/3], so that on this 
segment (2.14) is an absolute minimizing solution. 
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Let us find the maximal value of ¢; for which the extremal (2.14) satis- 
fies (2.16), i.e., let us find a continuous function o(¢) which satisfies (2.16) 
over the maximal interval (0, ¢#;). This function is obtained by solving the 

uation 
eer. 1+o0?+0=0, 
and choosing a solution which is continuous over the maximal interval 
(0, 41). The general solution of this equation is 


a(t) =—tan(t+c). 


A particular solution which is continuous over the maximal interval is 
o= — tan(t++) . 


The continuity interval of this solution is (0, m). Thus for any ¢;<m, the 
extremal (2.14) is the absolute minimizing solution. As we know /7/, the 
point ‘=n is the conjugate of the point (0, 0) and by Jacobi's necessary 
condition there is no minimum on the extremal (2.14) for &>n. 

Our sufficient conditions in this case thus coincide with the necessary 
conditions "apart from the point f;=2". 


2.1.1. A strong local minimum 


We say that the functional / has a strong local minimum on the extremal 
y(t), w(t) if there exists e>0 such that 


1g (t), (DIST [y(d), a(d)] (2.17) 


for all the pairs (y(t), u(t))E D, |y(t) —9(1)|<e, everywhere on [lp, 4]. 
Let f‘ and f° betwice differentiable functions for any ¢€[fp, #,]; 
(u, y) CV(t) and the set Vx(f) is an open bounded region. 
We have the following sufficient conditions for a strong local minimum. 
Theorem 2.2. Let the vector functions g(f), a(t), w(t) constitute a 
solution of system (1.3), (2.4), (2.5). Among these functions, u(t)is a 
strict maximum of H(t, u) everywhere on [fo, fi], i.e., 


Hit, u)< A(t, vw), ue Vu (1), ua, (2.18) 
and the matrix ||—H 


asatll S> t=1, 2,..., ris positive definite. 

Then a sufficient condition for the functional / to have a strong local 
minimum on the extremal (g(t), w7(‘)) is the existence of n? continuous 
piecewise-differentiable functions o,;(f) which on [fo, ¢,] satisfy n differential 
inequalities 


Arif, o(£), 6(2)]>0, j=1, 2,..., 0. (2.19) 
Here 
—Ruw soe eee Ruut Rey ses = Ruryi 
Api Raw . oo — Rata — Rury eee — Raryd (2,20) 
—Ryw—Ryu. o- —Ryur —Ryy we 2 SRyy) 
—~Rylur— Ryiar » Shee dh es Re ee a —Rylys 
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Rusut= Hasut; 


Resyl= oufart Hast : (2.21) 
Ry ty i= Si pf yh + Sieh yi + Ay iyh; 
i, jf, R=1, 2,..., m5 8, = 1, 2,..., 7. 


(summation over & is implied). 

The bar denotes the values of the corresponding functions for y=j(t), 
u= u(t). 

Remark 2.1. The conditions of the theorem incorporate the conditions 
of negative definiteness of the quadratic form 


@R |t, y, U=Ryiyidy'dy’ + QRytasay' ae + 
+ Rusut wan (2.22) 


for any fixed ¢€[t, f)] (summation over repeating indices is implied). 
Proof. The function g(t, y) is chosen in the form (2.9). Then 
R(t, y, u) takes the form (2.12), where the matrix o;,(f) satisfies inequalities 
(2.19). 
By (2.1), there exists e:>0 such that 


R(t, y. u)<R(t, H(t), BD) (2.23) 


for |y—y|<e,, |u—u|<e, everywhere on (fy, 1). 
Now, by (2.18), we have 


R(t, y, u) <R(t, J, @), uCVult), uA, 
i.e., for alluCV.(¢), with the exception of @. 
Since R(t, y, u) is continuous, for every fixed ft there exists e2>0 such 
that 


Rit, ¥, D<RUE y, 2) (2.24) 


for ~ 
ly~ yl <e; ZEV, (8); |u — al < eq: LE (ty, th). 


Let € be the smaller of the two numbers & and &€2. Then, by (2.24), 


R(t, y, u) <R(t, 9, 2); 


ly—-F|<e, wCVu(t), wD, EC (to, 4) (2:29) 
and it follows from Theorem 1.1 that the functional / has a strong local 
minimum. 

Remark 2.2. Theorem 2.2 is formulated assuming an open set 
V.. If Vu is a closed region, we have the following analogous 
theorem. 

Theorem 2.3, Let the vector functions j(f), w(t), p(t) constitute a 
solution of systems (1.3), (2.4), (2.5). Here u(t) is a strict maximum of 
H(t, 7, p, u). Then a sufficient condition for a strong local maximum on 
the extremal (g(t), @(f)) is the existence of n? continuous piecewise- 
differentiable functions o;;(¢) and a number e>0 such that for all nonzero 
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jAy'|<e, lAu|<e 


(2.26) 
j, i=1, 2,..., 2, S=1, 2,..., 75 EV, CE (fo, 4) 
we have the inequality 
F PREG w+ ARE, Y, W=RAY+R CL YW) Ou + 
(2.27) 


of > (Rysut Ausaut + 2Rusy! AusAy! + RytysAy'Ay)) <0. 


Here Ry, R R,, are the coefficients defined by (2,21), 


aSy Nysy 
Ay=y—Jy; 
Au=u—ui. 

Suppose that a solution oj;(¢) of the system of differential inequalities 
(2.19) has been found and the existence of a strong local minimum on the 
extremal (y¥(/), w(f))has thus been proved. By Corollary 2.1, the pair 
(7(t), u(t)) is the sought solution if the function R(t, y, u) defined by (2.12) 
has an absolute, as well as a local, maximum on V(f)at the point (7(¢), a(2)) 
for any ¢€ (fo, fi). 

Note that the proof of the existence of an absolute minimum on the 
extremal (y(t), w(¢)) is often conducted in a straightforward manner, without 


first proving the existence of a local minimum. This, in particular, is 
the approach used in the following two problems. 


2.1.2. Systems linear in the phase coordinates 


Let the right-hand sides of equations (1.3) and the integrand in (1.1) 
have the form 


fi=ah(t)y tart, uv): i, f=1, 2..., 05 (2.28) 
fo= aj(t) y+ ACE, 4). (2.29) 
The boundary conditions are fixed: 


Yor Yot, Yi=Yit- 
Equations (2.4) and (2.5) in this case take the form 


by + ay, — a} =0; (2,30) 
H*(t, uw) = (t)a'(t, Z)—AI(L, w= sup H*(E, xu). (2.31) 
weV,(t) 


Suppose that (g(¢), z(t), p(¢)) is a solution of (2.28), (2.30), (2.31). We 
will show that the extremal (y(t), z(¢t))is an absolute minimizing solution 
of the functional over the set D. 
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Let 


o(t, y) =Hi(t)y’, i=1, 2,..., 2. (2.32) 
Then 
RE, y, W=(HA5+h—a)y +A (t, =H", 2). (2.33) 
Since R(t, y, u)is independent of y, using (2.31) we find for all ¢€ (to, #1) 
R(t, y, %) =supR(t, y, u)=p(t). (2.34) 
(Hwy eV Ut) 


Thus by Theorem 1.1 the pair (7(t), z(¢)) is the absolute minimizing 
solution. 


2.1.3. The problem of the minimal mean error 
Let the right-hand sides of equations (1.3) have the form 


fi =a) (t)y ous, : 

i, j=1, 2,..., 2; s=1,2,..., 7, (2289) 
and the function f°(¢, y, u)is non-negative and convex on V(£), ¢¢ (to, 41). This 
is generally the final form to which the problem of optimal control of the 
perturbed state of a system near some given state y(t) =0 is reduced. 
Equations (2.35) are thus the equations of the perturbed state, and the 
function (1.1) is a measure of deviation of the system from the given state. 

Equations (2.4), (2.5) defining the extremal have the form 


bj-+ ayy, — fyi =0; (2.36) 
H(t, y(), 9), w= sup H(t, (2), 9(2), 2). (2.37) 
“ae Vio 


Suppose that (g(t), u(t), 4(t)) is a solution of system (2.35)—(2.37). We 
will show that the extremal (j(¢), u(¢))is an absolute minimizing solution. 
We take q(t, y) in the form (2.32). Then 


R(t y W= (Hab y’ +4 Obs — ft, y, w). 
The function R(t, y, u)for every fixed fis convex and it thus has a single 
supremum y, u onV(t). Therefore, by Theorem 1.1, the pair(y(t), @(t))is 
an absolute minimizing solution. 


2.1.4. Jacobi's necessary and sufficient condition 
of the variational calculus 


Using the particular case of a very simple functional, we will establish 


a relationship between the conditions of a maximum of the function R and 
Jacobi's variational condition of a weak local minimum of a functional. 
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Consider the problem of minimizing the functional 


T= { fr, y, u) at; (2.38) 
to 
y=2, 
Yo, Yy are given, 


where y(t), u(f) are scalar functions, f(t, y, u) is a continuous function 
with continuous partial derivatives to third order inclusive. 

Let V(t) with ¢ € (to, t,) be an open domain. 

Let the functions ¢(¢, y) have continuous partial derivatives with respect 
to ¢ and y to third order inclusive. 

The function R is written in the form 


R(t, y, u) =pyu—P(t, y, u) +i (lt, y). (2.39) 


For sup R with respect to y and u to be attained on the pair (y(/), Z(t)) ED, 
it is necessary that 


a ne ee 
Rt Ay iT * as 
R, = H,=9—-fi=0. 


Note that conditions (2.40), together with equation (2.38), are equivalent 
to Euler's equation (which may be obtained by eliminating tp and u between 
the appropriate equations). 

The next necessary condition of a maximum of RF is that the quadratic 
form 


@R=RyyAy? + 2RywAyAut RuyAw (2.41) 


be non-positive; here bars denote the values of the second-order derivatives 
corresponding to the pair (#(¢), w(t) ). 
Setting o(f) =Qyy(4 ¥(t)), we obtain 


Ran= — Sin (f, y, u)=—Fini (2.42) 
Ry, =3— fu: (2.43) 
Ry =3— Suv (2.44) 


Let y(f)be a continuous function. Then o(t)is also continuous. 
A quadratic form is non-positive if and only if the diagonal minors of the 
matrix 


—Ry —Ry, 
Ry Rua 
are non-negative /4/. Using (2.42), we can thus write 
1) Faw > 0; (2.45) 
2) —fau(s—Fiv)—(¢—Fiay > 0. (2.46) 
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The first inequality in (2.45) is Legendre's well-known condition. Let 
us consider in some detail condition (2.46): this is a differential inequality 
involving the function o(¢). Let fi >0 everywhere in[fy, t,]. Note that since 


o(t) is quite arbitrary, condition (2.46) can always be satisfied on a 
sufficiently small interval (fo, t:) by an appropriate choice of the initial 
conditions o(fo) ando(fo). To check whether or not (2.46) is satisfied on the 
entire interval (fo, 41), we write it in the form of a Riccati equation for the 
function o(f): 


fi CH= f,)—b— fy =r RK (2.47) 


where e(/) 20 is some continuous function of ¢. We know from the theory 
of differential equations that the singular points of the solution of (2.47) 
(the points where a(t) does not exist) coincide with the zeros of the non- 
trivial solution of the second-order homogeneous linear equation 


d — 4 a — _ 
gy Fat) +| 4 Few Fav +e] 0=0, (2.48) 
which is obtained from (2.47) by a transformation 
o— Fu Faw: (2.49) 
For e~0 equation (2.48) reduces to Jacobi's equation /2/ 
d —, . — dc 
4 (Faw) ~ (Fu — Fu) 0= 0. (2.50) 


Equation (2.48) will be investigated using Sturm's theorem and the 
theorem of the alternating zeros of solutions of second-order linear 
homogeneous equations. The theorem of the alternating zeros indicates 
that the maximal interval (fo, ¢;) on which the solution v(f) of equation (2.48) 
does not vanish for a given ¢(t) corresponds to the solution which satisfies 
the initial condition u(f))=0; in other words, it corresponds to a solution 
with one of the zeros coinciding with the left-hand end point of (éo, ¢:). Let 
this solution be designated v(t, fo). Sturm's theorem leads to the following 
proposition. 

Let v(t) =0 for equation (2.48) for any e(f)>0. Then among all the 
equations (2.48) with any e>0, the maximal interval (fo, f) on which the 
solution u(¢, f9)does not vanish corresponds to Jacobi's equation. 

In other words, under the above donditions, for e(t)>0, the next zero 
of the solution of (2.48) on the right of 4) is no farther from fp than the 
corresponding zero of the solution of Jacobi's equation, generally called 
the conjugate point of t=. 


Thus, if Top >0 and there exists a continuous and differentiable function 
o(4) such that condition (2.46) is satisfied on (é, t,), the interval (to, 4) 
contains no points conjugate to 0, this being Jacobi's necessary condition 
for a minimum of the functional (2.38). 

Conversely, the comparison theorem shows that if (fo, 41) does not 
contain points conjugate tof, there exists a function e(f) >0such that the 
solution u(t, fo) of equation (2.48) does not vanish anywhere in (to, fy). 

If ¢,; is the conjugate of 4, we have e(t)=0. Then there exists a 
corresponding solution of Riccati's equation or, in other words, a function 
o(t)such that (2.46) is satisfied. 
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Let us consider the conditions of negative definiteness of the form (2.41), 


namely that everywhere in[to, 4], Tha >0, and on (fo, t;) there exists a function 
a(t) such that the strengthened condition (2.46) (the strict inequality) is 
satisfied or, equivalently, e(t)>0 on (fp, t1) in (2.47) and (2.48). The form 
(2.41) is then negative definite. We will show that when these conditions 
are satisfied, the segment [fo, t;] does not contain points which are conjugate 
to to. 

Suppose that this is not so, i.e., ¢, is the conjugate of ¢ (the interval 
(to, f), as we have seen before, contains no conjugate points) and u(t, to) 
vanishes at 4. Then for any e(t)>0 the nearest zero of the solution of 
(2.48) is nearer than t,, lying inside the interval (fo, 1). This signifies 
that the quadratic form d?R cannot be negative definite, at variance with 
the conditions of the theorem. 


Conversely, let Faics0 on [to, ¢,] and let [fo, f;] contain no points which are 
conjugate to tf). Since the solution of Jacobi's equation for u(t) =0 does 
not vanish on the half-open interval [t, t)], there exists 6>0 such that the 
solution of equation (2.50) does not vanish on (é, ¢; +6) either. 

Because of the continuous dependence of the solutions of linear 
differential equations on parameters, there exists a constant e>0 such that 
the solution of equation (2.48) does not vanish on [f, f;] either. Hence it 
follows that a function o(f) satisfying the strengthened condition (2.46) 
exists on (fo, ;). 

We have thus proved the following theorem. 

Theorem 2.4. Considerapair (y(t), u(t)) € D, where u(t) is continuous 


and 7°, >0 everywhere in [fo, tJ]. A necessary and sufficient condition for 
the existence of a continuous and continuously differentiable function a(t) 
such that for any fixed (€ (fo, f,) the quadratic form 


@R=RyyAw+2RyyAyAut RyyAy? 


is negative definite is that for the pair y(t), w(t) the interval (fo, 41) (or the 
segment [fo, f:]) contains no points conjugate to f. 
Thus for a nondegenerate (f°, #0) classical Lagrange's problem, the 


ua 


condition of a non-positive quadratic form 


pasa (2.51) 
t€ (fo, f) 


is equivalent to Jacobi's necessary condition of a weak local minimum, and 
the strengthened condition (2.51) is equivalent to Jacobi's strengthened 
condition. However, the region of application of condition (2.51) or of its 
extended version (2.27) is much wider than of Jacobi's condition: they are 
applicable to degenerate problems (when the determinant vanishes, 

|7 114; (4 9, 4) |=0 ) and to problems with a closed region V,, where 


Jacobi's condition does not apply. Note that it is for these problems that 

the analysis of the second-order conditions is especially important. They 
are needed not only for checking purposes, but are actually used 
constructively, since they provide a means for isolating the local minimizing 
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solution from the set of all extremals (in these cases, as distinct from the 
case of nondegenerate problems, the extremals are not unique). 

Example 2.2. As an example of such a problem, let us minimize 
the functional 


+1 
/=— \ ty’dt; 
| 


: (2.52) 
y=u;, [ul <1; 
y(—1)=y(1I)=0. 
FIGURE 2.1 
Conditions (2.4), (2.5) take the form 
p= — Qty: (2.53) 
H= w+ tP=supH (t, y, 2). (2.54) 
|u]<1 
The last condition corresponds to three types of control: 
+1 for $>0 
u= —1 for <0 (2,55) 


arbitrary within the limits [u|<1 for =O. 


It is readily seen that conditions (2.52), (2.53), (2.55) are satisfied by an 
infinite set of extremals (Figure 2.1), which differ from one another in the 
number of control switching points and the number of isolated zeros of the 
function p(t). The initial segment of each extremal corresponds to the 
third control type in (2.55). Indeed, differentiating the identity p(/)=0 
and inserting (2.53), we find y(t) =0. 

To select the absolute minimizing solution from among these extremals, 
a more detailed investigation of the maximum of R(é, y, u) is required. 
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We have 
R=oy(t, yutty?+or(t y). (2.56) 


FIGURE 2,2 


Here (4, y) is regarded as a thrice differentiable function. 

We seek a solution of the problem satisfying the condition of the 
fundamental theorem. Such a solution g(t), u(t) should satisfy the con- 
dition 

R(LY, W)BR(E y, 4) (2.57) 


for all (y, u)EG. 

Here G is a horizontal strip on the y, u plane between the straight lines 
u=+1 (Figure 2.2), For (2.57) to be satisfied on G, it is necessary in 
particular for (2.57) to be satisfied on a thin strip G, enclosing the segment 
[-1, + 1] of the vertical line y=y inthe (y, u)plane. For those ¢ when 
p%0, we have u=+1. The necessary condition for this is 


Row = Pu (Ey yet Py + 2t| gegen <9, 


unu(t) 


or, seeing that y=, and writing 


o(/)— gy lt, ¥ (A). 
we obtain 


Ry (4, % a) =ot2<0. (2.58) 


In G,. we need only retain the first terms in the expansion of ~ and R 
in powers of Ay=y—y, and we may thus write 


a(t) 


et, N=, YO) 19M dy + Ay? + 0(AN%): 
R=(ptoay) wt (yt Ay? +4 (ay) + 
+ EAP +86 FO HO +O ANY. 
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We see from the last formula that for p=0 the function R(y, uw) has a 
supremum on G, at the points (y,1) and (y,—1) for o=Oonly. Thus, for 
condition (2.57) to be satisfied for ¢€(—I, +1), it is necessary that there 
exists a continuous function o(t) satisfying (2.58) which vanishes for those 
t when p=0, i.e., at the switching points. It follows from (2.58) 
that for t>0, o increases and consequently there may exist only one f=v, 
for which o=0, i.e., (2.57) is satisfied only by an extremal containing at 
most one switching point with a positive abscissa. The only such 
extremal is extremal 2 in Figure 2.1. 

Thus, the necessary condition for the existence of o(t) for which R has 
a maximum in y in a small neighborhood selects a single extremal — 
extremal 2 — from among an infinite number of extremals. This extremal 
is readily shown to satisfy the conditions of Theorem 1.1. Let @ be given 


in the form a(t) = 
et, =H Out > (y—yOP, (2.59) 


where a(t) is a continuous function satisfying inequality (2.58) and the 
condition o(¢§?))=0, o(t)=0 for t<t!?’, Then R is a quadratic form and 
satisfies (2.57) for any t, whence it follows that extremal 2 is an absolute 
minimizing solution. 


2.1.5. General boundary conditions 


1, Let the sets V,(to) and Vy(ti) be some general sets in the space Y, 
and not the points y) and y,; as assumed before; ¢,; may take values froma 
given bounded set t. The function F(t,, y;, yo) entering (1.1) is continuous 
and differentiable. In this case, additional conditions (1.24) of Theorems 
2.1—2.4 are imposed on the sought minimizing solution y(t), w(t) and the 
function p(t, y), which demand that the function M(yo, y;, f)) have a minimum 
for Yo=Jo. =, f= and that w(/)=0. 

A necessary condition for a minimum of @M(t, y1, yo) at the point 
t1, Yo. 7; is provided by the inequality 


hy nyt A SO (2.60) 
1 


for all 
LET, yoCVy(to), yr EVy(h), 
where AYo= Yo— Yo: Ay;=9, hh: Ah=t4,—4, are all sufficiently small (the 


bar denotes the values of the derivatives for yi=¥i, i=). 
Using (1.8), we write (2.60) in a more explicit form: 


[= (fo, 70 | AYo {e+ ey (4, a] Ayt 
YO Yt 


OF 7 
os | 4 (A; in| At, > 0. 
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Using the notation from (2.1) and (2.6), and seeing that in virtue of the 
identity p(t)=0 


de (ty (t)) ro _ Ov(t, y(t) 
f Fo ft 


y, a) = 
ot Oy y ) 


=—Hit, y, 0, 4(6), 


we finally obtain 
[= -9 a) Ayo + [= +4 | ay + 
Yo iy) 


oF Pere, 
+| 2-H (4, 7, Wh) 04 >0 


for all 


HEt, MmEVy(t);s Yo Vy (4). (2.61) 


This inequality, together with the conditions yo€ Vy(to), y,CVy(t1) and 
t€t, specifies 2n+1 boundary conditions for the system of differential 
equations (1.3), (2.4), (2.5) and a finite value of the argument f,. 

These boundary conditions can be written in a more concrete form if 
the particular sets t, V,(t1), and V,(¢)) are defined explicitly. 

In our previous example, when the sets were identified with fixed points 
in space, inequality (2.60) did not impose any constraints on the vectors 
(fo), (ti), since 


Ayo= Ay, = At, =0. 


Another common type of boundary conditions corresponds to the following 
case: Yo is a fixed point, the set V,(t;) coincides with the entire space Y, 
i.e., there are no constraints on y,, and the set t corresponds to the 
semi-infinite interval (to, 00). In this case, for inequality (2.60) to be 
satisfied, the coefficients before Ay, and Af, must vanish, i.e., 


Wy=-=, ee 


ay) , 


H(t, y(t), was. (2.63) 


These equalities define the +1 missing boundary conditions for the 
system (1.3), (2.4) and the time f). 

When investigating the sufficient conditions for a minimum, the condition 
of a minimum of (Yo, y,, t;) imposes additional boundary conditions on the 
differential inequalities for the functions ojj(¢/). They may be derived from 
the inequality 


1 on ao ob 
a0+— d= —aA a cake 
+P 2 ay5 wt s Ay + at At, + 


1 aro i,j O26 i,g 
> (=F Aways +2 aya 
oy Gara Yodye +2 sis Ayoay + 
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P i ae t 
2—— Aydt, +2 —— A4f,+ 

+ oy iat, Goh dy!ot Ayn 

ob 


ey ee) 
Ayia 2” (at, ? | > 0, 
+ syfayi yidyi + eae 1 (2.64) 
which should be satisfied for all sufficiently small non-zero increments 


Ayo, Ay:, At; when 


BET; Yo Vy (to); mE Vy(4). 


Here 


Sa +HtS; 
1 YW) 


ae = ~H (t, y(t), a(ty)); 


SO Or ao py 
ayidy, ayldays 8" 
ob OF 
dyyay) — Aayjay{” 
ap OF 
dygat, — dyiat, * 


Qo OF : par 
oy at, ~~ ayiaty +; (44) — 9:3 (4) y’ (A): 
ee ac OE 
dylan] ayfay{ + uk 
Rb OF 


(aie (ate H— yt ajyy. 


Summation over repeating indices is assumed throughout. 

For the previously considered particular case — the problem with a free 
right-hand end point — inequality (2.64) is equivalent to the condition of 
positive definiteness of the quadratic form 


arb ‘nfl bo ob : 2b ‘: 
dy ayi AyiAyit ay tat, Aydt, + (0t,)2 (a4). 
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2.1.6. Numerical aspects 


Lagrange's method thus reduces the variational problem to finding the 
extremal (y(t), Z(t)) and a vector function »(f), i.e., to solving a system of 
ordinary differential equations of order 2n with boundary conditions (2.61), 
followed by verifying the sufficient conditions of optimality on this extremal, 
e.g., by solving the differential inequalities (2.19) for the functions oi;(t). 

It is significant that the boundary conditions are separated, i.e., some 
of them correspond to the value f) of the argument, while the remaining 
boundary conditions correspond to #;. The result is a boundary-value 
problem, as distinct from the Cauchy problem, when all the boundary 
conditions are specified for the same value of the argument, f or ¢,. The 
majority of the modern computation algorithms solve a given boundary- 
value problem by constructing a sequence of solutions of Cauchy problems 
obtained by varying the missing initial conditions until the required final 
conditions are satisfied. 

Let us consider the actual procedure for solving the Cauchy problem 
for system (1.3), (2.4) by Euler's polygonal method. According to this 
method, the segment [f, t)] is divided into s parts by the points 
Ti=lo, T2>T T3>T2 -.. ts=hh. Given y(fo). (fo), we find uw(fo) from the 
condition of the maximum of the function A(#, y(fo), (to), u)and, inserting the 
result on the right in (1.3), (2.4), we obtain g(f)) and wp(to). The finite- 
difference formulas of Euler's method 


Y(t) y (ty) + Y (to) (to — 1); 
(T.)= (to) + (fo) (T—T}) 


are then applied to determine y(t) and (tz). Reiterating the procedure, 
we find 


y(t), (Ta)... 9A), (4). 


In solving the boundary-value problem, this procedure is repeated for 
various values of y(fo), (fo) until y(t), ptr), f) satisfy the final conditions 
with sufficient accuracy. 

This explains why the solution of boundary-value problems is so time 
consuming, even with modern computers. 

Various methods exist which cut down the search for the initial values 
Y(t), (fo) and often markedly reduce the volume of computation. These 
improved methods include Newton's method /3/, the method of fastest 
descent /9/, random access methods /5/. However, even with these 
methods, the solution of boundary-value problems involves significant 
difficulties, especially as none of them is absolutely reliable. 

It should be emphasized that despite all the efforts, the extremal ob- 
tained in this way may prove not to be the sought minimizing solution, 
since equations (1.3), (2.4), (2.5) only give the necessary conditions for a 
maximum of R. 

No adequate algorithms are available so far which would enable us to 
tackle the second part of the problem, namely the verification of the 
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sufficient conditions. In a number of cases, this problem is solved 
relatively simply by choosing an appropriate function @, as in 2.1.2 and 
2.1.3 above. In other cases, the solution can be simplified by using 
inequalities (2.19) for o:;(t). 


§ 2.2. HAMILTON—JACOBI—BELLMAN METHOD. 
OPTIMAL CONTROL SYNTHESIS 


In the present treatment, we choose a fixed t, and make V,(t) coincide 
with the entire space Y for /€ (t, t\] and with the point y=¥Yo for t=(o; 
F(y;, yo) is independent of yo. 

We construct the function 


P(t, y)= sup Re y, tt) (2.65) 


uBV (hy 


and try to select p(t, y) so that P is independent of y, i.e., 


P(ty) = MS cae aa Cy 9 =C(8), (2.66) 


where C(t) is an arbitrary piecewise-continuous function. 
Then P(t, y)=p(t) at any point of the space Y. Let u(t, y) be the value 
of u for which R(t, y,u)attainsa supremum at the point ty, i.e., 


R(t, ¥, u) =P, ¥) (2.67) 
and let y(f) be the solution of the system 
y=[Ie, y, Z(t, y)] (2.68) 


with the boundary condition y(0)=y; also let u(t)=a(t, y(t)). The pair of 
vector functions (y(t), w(#)) belongs to the class D and satisfies condition 1 
of Theorem 1.1. For this pair to satisfy condition 2 of this theorem, i.e., 
to be an absolute minimizing solution, it suffices to take 


F(y)+ (4, y) =const, (2.69) 


i.e., it is sufficient that for t=t, the function F+@ be independent of y. 
Thus, if the function g(t, y) can be selected so that sup R is independent of 
u 


y, or more precisely, if we can solve the partial differential equation 
(2.66) with the boundary condition (2.69), our problem is completely solved. 
Moreover, a much more general problem is solved, namely how, starting 
from any given state (to, Yo) CV, to reach the least value of the functional 
(1.1) in atime f,. To accomplish this, at every point (¢, y), starting with 
(to, Yo), we choose a control function u(t, y). If p(t, y) is known, u(t, y)is 
obtained from (2.66). The function y(t) is then found from (2.68). 

Using the terminology of the automatic control theory, we refer to the 
solution of the variational problem constructed in this way as the 
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optimal control synthesis, and to the field u(é, y) as the 
synthesizing function. In distinction from synthesis, the optimal 
control function w(t) corresponding to the given initial conditions (to, Yo) 
is called the optimal control program. 

These two forms of solution of the problem correspond to two fundamen- 
tally different methods of practical optimal control. The synthesis solution 
is implemented by a closed automatic control system (a feedback system). 
In this system, the synthesizing function is regarded as an algorithm which 
determines the response of the feedback operator to the signals from the 
sensor elements measuring the time and the current state of the controlled 
object (Figure 2.3). 

The control program solution is implemented by an open system, without 
feedback, as shown in Figure 2.4. 


Controlled object 

IS (HY) u) 
Feedback u(t) Controlled object y(t) 
“u(t,y) y=flt,y,u) 


FIGURE 2,3 FIGURE 2,4 


The method described above is in fact the Jacobi- Hamilton method 
whereby variational problems are reduced to partial differential equations. 
Equation (2.66), apart from the arbitrary function C(t), coincides with 
Bellman's equation /1/. 

Bellman's equation is thus obtained as a particular case from 
Theorem 1.1 with a function g(#, y) of special form. At the same time, 
this equation constitutes a sufficient condition of an absolute minimum. 


2.2.1. Optimal control synthesis for systems linear in 
the phase coordinates 


Let the right-hand sides of equations (1.3) and the integrand in (1.1) have 
the form 


a 


ft=>>d ak (ty! +ar(t, x) (i,k =1, 2, 25) 
f=1 (2.70) 


f= a%(t)y!+AO(t, 2), 


or in vector notation 
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f=A(t)yt+hA(t, u); P=ar(thy+A(é, x), (2.71) 


where A(t) =|la¥ || is the matrix of the coefficients (i, k=1,2,..., n). The 
set Vuis assumed to depend on f alone. 

Our object is again to minimize the functional (1.1). The conditions on 
V,(¢) are the same as in the previous section. Let 


F(y )=hy, (2.72) 


n 
where h=(j,\,,...,4,) 18 a given #-dimensional vector; y=> d,y! is the 
i=l 
scalar product of the vectors i) and y. ; 
Inserting (2.71) in (1.9), we obtain 


Rit, y, &) = (A*qy—a°)y+@r(t, ¥) +oyh(t, u)—h(E, 4), 


where A* is the transpose of matrix A, i.e., A*=|la! |]. 
Now from (2.66) 


Pt, y)=(A*ey—a%) y+o,+H (, y); 
H(t, yo. [eh (t, 4) — ACE, w)]. (2.73) 


In order to satisfy equation (2.66), we have to choose g(f, y) so that the 
function P is independent of y. 
We seek ¢ in the form 


p= pty. (2.74) 
Inserting (2.74) in (2.73), we find 
P=(o+ A*p—2%) y+ 5¢(t); (2.75) 


H(t)= sup [p(d)A(Z, w)— AE, w)]. (2.75a) 
u€V, (4) 
The vector function w(t) is defined by the equations 


b+ A* (A) p= 2° (2); (2.76) 
pe )=—A. 


The function P(t, y)=# (t) is then independent of y and O(y) =Ay+ 
+(f)y=0, i-e., conditions (2.66) and (2.69) imposed on g(t, y) are 
satisfied. The search for the function g(#, y) is thus reduced to the 
solution of a Cauchy problem for a system of linear differential equations 
(2.76) with initial conditions (2.69). If A and a° are independent of ft, 
equations (2.76) are integrable in a closed form. Given the vector »(é), 
we find the optimal control function u(t) from equation (2.76), which in‘this 
case has the form 


wp(t)A(t, Z)—A°(t, 7) =H (t). 


Let 
ht =b® u (k=0,1,..., 7), 


where #4 is a scalar (i.e., r=1) constrained by the inequality |u| <1, b*(t) 
are continuous functions on [to, ti]. We have 


# (t) =sup [p(t)6—b°lu= |p(t)b—6°|. 


bal<. 


Here b=(0!, 6?7,...,6"). We have 


u(t)=1 for all ¢ when +p(t)b—6°>0; 
u(t)=—1 for all ¢ when (t)b—6°<0. 


The equation 
p(t) b—b°=0 


defines the set of "switching points.” 

The problem of the optimal approach to the hyperplane ¢=f, from any 
position yo is thus completely solved for the particular case when 
f' (é=0, 1, ..., a) are linear functions of the phase coordinates. 


2.2.2. Algorithmic features of the method 


The Hamilton-Jacobi method reduces the variational problem to the 
solution of a Cauchy problem for a second-order partial differential 
equation for the function @(#, y). This problem has a number of charac- 
teristic features. Equation (2.66) contains only the partial derivatives Qy 
and g;, and does not contain the function @ itself. All the partial 
derivatives, except g:, enter the equation nonlinearly, because of the 
nonlinear operation of the supremum over u. Equation (2.66) is always 
solvable for the partial derivative g:; the boundary conditions are 
defined on the hyperplane f=, in the (¢4, y) space. Because of the 
last two factors, the argument ? occupies a preferential position among 
the +1 arguments of the function g~ and makes it possible to construct the 
solution of the problem in the direction of decreasing ¢, starting with ¢,. 

The solution algorithm, unlike that of the Lagrange's method, is thus 
independent of the boundary-value problem. This is one of the principal 
advantages of the method. Another advantage is that the solution of the 
partial differential equation (2.66) completely exhausts the solution of the 
variational problem: any pair (y(t), uw(t)) constructed by this method is 
the absolute minimizing solution, whereas in Lagrange's method the 
solution of the boundary-value problem for the corresponding system of 
differential equations only constitutes the first step toward the solution of 
the problem, as it remains to verify whether or not the extremal obtained 
in this way is indeed the absolute minimizing solution. Finally, an obvious 
advantage of the method is that it solves the optimal synthesis problem, 
which constitutes a much more general problem than the simple search for 
a Single pair (7(t), u(t)). 
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The greater generality of the result, however, is an outcome of the 
greater complexity of the algorithm: instead of a system of ordinary 
differential equations, as in Lagrange's method, we are dealing with a 
partial differential equation (2.66). No regular numerical methods are 
available for such an equation. Moreover, no existence theorems for the 
solutions of equations of this type are known either. 

It is therefore desirable to develop suitable methods for approximate 
optimal synthesis. A general approach to this task and one of the particular 
schemes are described in the next section. 

A shortcoming of this method compared to Lagrange's method is that its 
application is more restricted. It is not by chance that it is described in 
reference to a particular class of problems, the so-called problems with a 


free right end point. 

However, numerous problems with a set Vyof a different structure may 
be approximately reduced to this particular problem by an appropriate 
modification of the functional. For example, problems with a fixed right 
end point (fixed y, ) for a functional / are readily reduced to problems with 
a free right end point for the functional 


TH (yi— Why), 


where y!/, are the fixed values, yi, 4; are sufficiently large positive numbers, 


§2.3. THE METHOD OF APPROXIMATE OPTIMAL 
CONTROL SYNTHESIS 


2.3.1. Statement of the problem 


In virtue of the equations 
y=f(t, y, 4) (2.77) 


the value of the functional 


ty 
i=J F(t, y, w)dt+ Fly) (2.78) 


is given if the initial point (f, yo) and the control program u(t) on [to t] 
are given. Let 


d (to, Yo) = inf 1 (to, Yo, u(y}, (2.79) 
{a ()} 
where {u(t)} is the set of all the admissible control functions on [fo, i]. 
A function u(#)is considered admissible if there exists a function y(t) 


such that the pair (y(t), u(f))C D. The latter relation implies that the 
solution of the system of equations 


y=l(t y, u(t)), y (to) =yo (2.80) 
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for all ¢€ [fo, t1] belongs to V,(#) and u(t)CVu(é, y(t)). It is assumed that 
{u(t)} is not empty. In virtue of the definition of a lower bound, there 
exists a sequence 


(ue()} ¢ (4), 
such that 
T {to, Yo, Ua (t)J—-d(to, yo) for a—oo, 


For every fixed initial point (4, yo), we have to select such a sequence 
from a certain given set Vy in the space (tf, y). In other words, we have to 
solve the previously stated problem for all tf and yoCVyo. 

Let fomm be the minimal ft) corresponding to the set Vyo. 

The set V,(t), ¢€[fo, ti], in each of the resulting problems is defined as 
follows: each ¢€[fomm, 4] is assigned to a set V,,(#) and it is assumed that 


Vy (to) =Yo CVyi (40); 
Vy (t) = Van (t) for ¢ € (bo, t)]. 


Our problem is solved once we have constructed a minimizing sequence 
of synthesizing functions ua(t, y) t Cltomm ti, y€ Vyi(t). A sequence uz (t, y) 
of synthesizing functions is said to be a minimizing sequence if the sequence 
of the control functions 


Ualto, Yo, t) =ue [t, Fa(t)] 


belongs to the set of admissible control functions and constitutes a 
minimizing sequence for any to, Yo €Vyo. Here yz (t) is the solution of 
system (2.80) with the initial conditions y(to)=Yo for u=ua(t, y). 

The problem is finally stated in the following form: in a region Vyo of 
the (n+1)-dimensional space (t, y), construct a minimizing sequence of control 
fields {ua (t, y)} for the functional (2.78). 

Every element of this sequence will be called an approximate optimal 
synthesis. The degree of approximation will be measured by the number 
Aa= SUP [1 (to, Yo, Ha(tor Yor #)]—4 (to Yo)- 

(to¥d€ V yo 

Evidently, a. —0Ofor a—oo, 

The following theorem enables us to estimate the accuracy of the 
synthesis without solving equation (2.65). 

Consider a function g(t, y) which is continuous and differentiable for 
£€ [fomm, fi], ¥EVyi(t). Given @(t, y), we can construct the function 


Ret, yu=ey(ty) Ff (t.y.4)— Pty“) +4. (2.81) 


Let &(t, y) be the control function for which R attains its maximum 
value on Vi(t, y), i.e., 


Rit.y.wlhy))= sup R(t,y,2) = P(t,y); (2.82) 
weV,, (ty) 
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© (y)=F (y) + (41,9); 
g . (2.83) 
m= inf ®(y); 
y Vy (tr) 
Z(t, to, yo) is the solution of the system 
y=S(yulty) y(t)=Yo 
and (2.84) 
u (¢,to,Yo)= u (¢,y (4, £0, Yo))- 
Let for all (to, Yo)CVyo and té (to, t;) 
y(t, tor Yo) E Vy(t). 
Let further 
t 
z=} | sup Pty)— inf P(ty)|adt+ sup O(y)—_ inf Sy). (2.85) 
tonin YeY yO y(t) y(t) y eV y(t) 


Theorem. The functional (2.78) over the field of control functions 
a(t, y) satisfies the following estimate: 


[7 [l0, 40.2 (4)] — 4 (lo, Yo) <A (40) <A. (2.86) 
Here @(t)=@(f, §(t)), f(t) is the solution of (2.80) for u=H(t, y). 


Proof. Consider the functional 


L= | Rie ya eyeldt—0(l.4) +9 (nt Fly (2.87) 


defined over the set E of the independent pairs of vector functions (y(t), u(t)). 
This functional has the following property: onthe set D C E of the pairs 
(y(t), a(t)) satisfying equations (2.80), L=/ for any (4, y). 


Let 
ly (),4 (O)=Lly Ou (4.4 49s (2.88) 
f= inf TZf[y(#),u(d)]. (2.89) 
(y(t) WEE 
We have 
~ ‘ 
T= —J F@dtteltnyd+m, (2.90) 
where 
p(t)= sup R = P t 
p(t) iad legs u] bale i, ¥] (2,91) 


Seeing that 
Rit, y, UOI=P Ie, 9}, 
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and using (2.85), (2.87), and (2.90), we obtain 


Zly (4), iT [—-P(. 9) +e )] dt +O [gy] —m <a (2.92) 
Since 
(y(4), u(Q)ED, 
we have 
T y(t), 4 (I=! [0,Yor% (2)], 
and thus 


I to, Yost QI—-T<A. (2.93) 


Since DCE, we see that d(to, yo) >i and, using (2.93), we find 


17 [lo Yo.#(4)] — 2 (to, 40) |< A. (2.94) 
Q. E. D. 
Corollary. Consider a sequence ¢ (f, y) such that 
A. =A(@a ) +0 (2.95) 


for a—-oo. The sequence of the control fields ie (t, y)is then a minimizing 
sequence. 

If a particular construction of the sequence @a(t, y) satisfying (2.95) is 
given and the paths y.(t) corresponding to qg.(¢, y) are seen, in virtue of 
(2.80), (2.82), and (2.83), to satisfy the condition 


Yya(t, to, yo) EVy(t) for t€ [to, ti], (fo, Yo) € Vy (to), 


the corollary implies that a regular algorithm for the construction of the 
control synthesis u(t, y) is also given, which for sufficiently large @ is as 
close as we desire to the optimum control synthesis. Moreover, the 
theorem provides a specific estimate (2.94) of the closeness of the synthesis 
ua (tf, y) to the optimal synthesis. 

In practice, the proposed method of construction of the minimizing 
sequence u(t, y) is implemented as follows: the sequence g.(t, y) is 
constructed so that A. —0 for a—-co, and condition (2.94) is verified a 
posteriori, after the construction of the synthesis u4,(f, y) . For this 
method of solution to be successful, the set V,,(¢) should correspond to a 
sufficiently large region of the space Y. Also note that if the set Vy: (¢) 
in the initial statement of the problem coincides with the entire space Y, 
it should be replaced by a bounded region to permit searching for the 
supremum in (2.91) in practice. Vy:(t)should remain sufficiently large so 
that condition (2.94) is satisfied. 

Remark. The construction of the sequence 9 (t, y) satisfying (2.95) 
may be considered as an approximate method of solution of the partial 
differential equation (2.82), if @a(t, y) is convergent in some norm. In our 
formulation, however, the question of the convergence of qg.(t, y) does not 
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arise. The sequence 9a(4, y), and hence @(t, y), may have no limit. The 
only requirement is that (2.95) be satisfied by the exact solution and that 
A. be sufficiently small for some a when the solution is approximate. 


2.3.2. Construction of a minimizing sequence 


As a particular example of the construction of a minimizing sequence, 
consider the following technique. g(t, y) is defined as a polynomial with 
known coefficients p(t): 


I 


iA ly a 
a y= & (v' (Ss (y?)’* ( . > Pi anealy cows). . (2,96) 


1,70 


Expression (2.96) for @. contains a=J,-l) ... l, arbitrary continuous 
functions ‘pi,1,...1, (f) which are selected so as to minimize A. in some way. 
To this end, we define N supporting curves y=ys (¢) which are arranged 
in V,(t) for each f in the following manner. The range of each variable 
y' corresponding to V,(f) is partitioned into segments by laying off 
points on the y! axis, 4, points on the y? axis, etc. Hyperplanes 
perpendicular to the corresponding axes are passed through these planes. 
The intersections of all these hyperplanes give N=i-h ... ly, tabular 
points. Repeating the same construction for every time ¢ and ensuring 
continuous time-variation of the coordinates of the tabular points, we 
obtain a family of supportingcurves y=ys(t). In practice, the construction 
of supporting curves should make use of families of curves which have a 
convenient analytical description, such as the family of straight lines 
y(t)=A,t+A2, the family of parabolas y=Aj,t?+Aot+Ap, the family of 
polygonal lines, etc. 

Inserting the expression for @, (tf, y) in (2.82) and demanding that (2.82) 
and (2.83) are satisfied on the supporting curves only, we obtain the following 
system of differential equations and boundary conditions for the functions 


y(t): 


q, fl, 


D4 (49) (3, (45)" ( SS Wiigus 01g") = 


i= inn i,=0 
= K (t)— 2 (t, yp,%y (Eye): (2,97) 
® (yp (¢))) = F lye) +¢ (tye (=A, (2.98) 


B=0, ty, fo. fe. 


Here 9(t, y) is defined by (2.96), and g,; (4 y) is the partial derivative 
of (2.96) with respect to y’. 

From the point of view of numerical work, the arbitrary function of 
time K(?) and the constant K; are conveniently set equal to zero, K(/)=0, 
K, = 0. The number of equations (2.97) then coincides with the number of 
reference curves. System (2.97) comprises N first-order linear ordinary 
differential equations in the unknown functions p(t). The functions yi (t) 
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entering the coefficients are known. We recall that the subscript B 
identifies a given vector function ys(#), and the superscript i specifies the 
particular component of this vector function. The initial conditions for 
(2.97) are defined by a system of N linear algebraic equations in M unknown 
functions *p(f;). The problem of solving (2.97) for the derivatives is 
equivalent to the problem of interpolation of the function — #(t, y, My), defined 
by its values on the supporting curves using the polynomial (2.96). Similar- 
ly, the problem of solving (2.97) is equivalent to the interpolation of the 
function [—F(y1, t:)] defined by its values at the points yj (t) using the 
polynomial (2.96). 

Our algorithm is built to cope with these problems, since, as can be 
shown, they are reduced to the solution of a series of one-dimensional 
interpolation problems using polynomials of degree /,-l,...,/n or, in other 
words, to the solution of a system of linear algebraic equations of order 
h, lz, ..., tn with non-degenerate matrices (Vandermonde matrices). 

The solution of (2.97) with the appropriate boundary conditions yields 
the functions tpp(/), which define g(t, y) and hence the approximate control 
synthesis &(t,y). The synthesis (4 y) obtained in this way evidently 
satisfies the estimate (2.94). 

In particular problems, to evaluate the estimate (2.95), it is often 
convenient to supplement the integration of (2.97) for (fo, 41) by a simul- 
taneous solution of the equation 

E= sup Pit, y)— inf P(é,y) (2.99) 
) 


WV ytt) yeVyt 


with the following initial condition for t=f,: 


t(4,)= sup &— inf &, 
(4) pers yV yl) (2.100) 
Evidently, 
A=E (ty). 


In the majority of publications on many-dimensional interpolation, the 
interpolation polynomial is given explicitly or implicitly in the form (2.96), 
and the reference points are arranged as described above. In /10/, the 
interpolation problem is solved in general form and general expressions 
for the polynomial coefficients are given. These expressions, however, 
are very unwieldy and it is not clear to what extent they will be useful in 
practical numerical work on computers and how favorably they will compare 
with information given in the original implicit form by equations (2.97). 

In numerical solution of (2.97), the derivatives should be determined at 
every step from explicit and implicit expressions. The number of steps 
should be sufficiently large. The example that follows shows that a slight 
modification of the original equations may enable us to choose the reference 
curves so that the elements of the coefficient matrix of (2.97) are either 
constants or simple functions of time; the elements of the corresponding 
inverse matrix can be determined beforehand, in the form of constants or 
simple functions of time. In this case, the determination of the derivatives 
at each step involves computations using fairly simple formulae. Since 

the matrix inversion is performed only once and thus hardly affects the 
total computer time, the exact form in which the original information about 
the function w(t) is specified is of no particular consequence. 
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Let us summarize. We described an algorithm for the approximate 
construction of an optimal control field ue (t, y). An estimate of the 
closeness of the approximate synthesis u. (t, y) to the optimal synthesis is 
given by expression (2.94). 

The algorithm comprises the following stages. 

1. a reference curves ys (t) are defined, i.e., a points in a region 
V,(t1) of the space Y for every fixed t€[to, 4]. 

2. The Cauchy problem is solved for system (2.97) of ordinary 
differential equations with initial conditions (2.98). System (2.97) should be 
solved numerically in the direction from 4 to fomm. While constructing the 
right-hand sides of (2.97), we determine the synthesizing function w. (tf, y) 
from (2.82). 

3. The number A. is computed from (2.95) and inequality (2.86) is then 
applied to determine the closeness of the synthesis w (t, y) to the optimal 
synthesis. Ifthe result is insufficiently close, a larger a is selected and 
the procedure is reiterated. 

Example 1. Construct the optimal synthesis for the system y=u in 
the region t<t#, of the ¢, y plane from the condition of minimum of the 
functional 


ty 


[=| wdt+ry(t), D0. (2.101) 


fo 


Because of the increment Ay;2, the functional falls ina sufficiently small 
neighborhood of y(t,;)=0 for sufficiently large 4. We have 


R(t,y, w= Fyu—uwr+e,; u(ty)="ey (ty) 
PlLy=supR(AywH—= he tes (by) = ae, 


The function F(y)=Ay? is a quadratic polynomial, and the least number 
a for which (2.98) may be satisfied is therefore a=2. Let us now construct 
an approximate synthesis u2(t, y). We have 


92 (Ay) (4) y+ de (2) ys 
Po= $y (4) + 2ho (2) y. 


The curves y~ya (¢t) (B=1, 2) are chosen as the straight lines 
Y= 0; yo g= +1. 
1. System (2.97) has the form 
‘ P 1 I 
htht+a [h+2eP—Z %¥=0; 
(2,102) 
. : 1 1 
—Kt+h+ |Z [4—-2P— ZT Y=O 
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or 
bth=0; b+h+hth=0. 
The boundary conditions (2.98) have the form 
tpi (£1) =0; rp2 (41) =—A. 
The solution of system (2.102), (2.103) is 


wQ=0; oQ=-, 


f+pr—e- 
Further, 
- po(t y= aay ae er ; 
M(ty= — Fy re; ; 


Check the closeness of wu2(t, y) to the optimal synthesis: 


P,(t.y) = (tay + bay? = 142 F bal y? = 0, 


i.e., the second approximation coincides with the exact solution. 


Example 2. Construct a synthesis for the system 


you |uj<i 


(2.103) 


(2.104) 


(2.105) 


(2.106) 


in the region 0</<0.5, |[y|<1lof the (é, y) plane which will be optimal in 
terms of ensuring minimum deviation from zero of the coordinate y in the 


measure 


t, 


I —{ yrdt. 


fo 


We have 


R(ty,u)= Pyle —y? $i | 
u(éy)=sign 9; 


Pit y=l%yl—w +P | 
# (t,y,Py)=| Py] — 


First approximation: 
pit, y) =9(t)y- 


Let ys ()= +1, B=1, 2. 
System (2.97) is written in the form 
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(2.107) 


(2.108) 


and the initial condition (2.98) takes the form ~(t,)=0. We have 
y()=0; Pi(ty)= — ys 
05 0.5 
=f |sup Pi (y) — inf P(y)| d= f dt=0.5. (2.109) 
é 
Any control function w(t, y) is possible by (2.108). The first approxi- 
mation in this case thus does not produce a synthesis. Estimate (2.109) in 
this case indicates that for any admissible control function, the deviation 


of the functional from its minimum value does not exceed 0.5. 
Second approximation: 


$o(t y= (2) y+ Ha(e) y*; 
Poy = 4) + 2hoy. 


Let 
yi (t)=0; yo3 = 41. 


System (2.97) takes the form 


bith tlt hl [il 1=0; rey 
— $+ b+ | 4 — 22)—-] 4, | -—1=0. 


Initial conditions (2.98) take the form 
$1 (4) =O; bo (t:)=0. 
This system has the solution 
by (2) = 0; y(t) = 1/2 [e2'-4) — 1]. 
Furthermore 
Uy (t,y)=sign 2, (t) y= — Sign y, 


It is readily seen that the synthesis u2(t, y) coincides with the optimal 
synthesis. Let us check estimate (2.94). Using the above relations, we 


find 


Po(ty)=(1— e%—4) (| y | — 9’); 
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Despite the fact that u2(t, y) is a strictly optimal synthesis, the estimate 
is not zero, unlike that in Example 1: the a priori estimate in this example 
is thus too high. 


Bibliography for Chapter II 


1. Bellman, R. Dynamic Programming. — Princeton Univ. Press. 1957. 

2. Boltyanskii, V.G. Matematicheskie metody optimal'nogo upravleniya 
(Mathematical Methods of Optimal Control). — Nauka. 1966. 

3. Vinokurov, V.A. Obobshchennyi metod N'yutona dlya resheniya 
kraevykh zadach (Generalized Newton's Method for the Solution of 


Boundary- Value Problems). — Kosmicheskie Issledovaniya, No. 4. 
1965. 

4, Gantmakher, F.R. Teoriya matrits (Matrix Theory). — Nauka. 
1967. 


5. Demidovich, B.P. andI.A.Maron. Osnovy vychislitel'noi 
matematiki (Elements of Numerical Methods). — Nauka. 1965. 

6. Krasovskii, N.N. K teorii optimal'nogo upravleniya (Optimal Control 
Theory). — Avtomatika i Telemekhanika, 18, No. 11. 1957. 

7. Letov, A.N. Analiticheskoe konstruirovanie regulyatorov (Analytical 
Design of Regulators), — Avtomatika i Telemekhanika, 21, No. 4—6. 
1960; 22, No.4. 1961; 23, No.11. 1962. 

8. Moiseev, N.N.. Metody dinamicheskogo programmirovaniya v teorii 
optimal'nykh upravlenii (Dynamic Programming Methods in 
Optimal Control Theory). — Zhurnal Vychislitel'noi Matematiki i 
Matematicheskoi Fiziki, 4, No. 3. 1964; 5, No.1. 1965. 

9. Eneev, G.M. O primenenii gradientnogo metoda v zadachakh teorii 
optimal'nogo upravleniya (Application of the Gradient Method to 
Optimal Control Theory). — Kosmicheskie Issledovaniya, No. 5. 
1963, 

10. Stojakovic. Solution du probleme d'inversion d'une classe 

importante de matrices. — C.r.Aca.Sci., 246:1133. 1958. 


55 


Chapter II 
DEGENERATE PROBLEMS. SLIDING CONTROL 


In Chapter Il we assumed that an absolute minimizing solution (y¥(4), u(¢)) 
of the functional (1.1) existed inthe class D of the admissible pairs of vector- 
functions y(#), u(¢#). This assumption is not universally true, although it is 
valid for numerous important problems. The class D may contain no 
absolute minimizing solutions. There always exists a sequence 
f{ys(t), us(t)} CD, however, such that 

M(Yysts)— d,d= inf J [ye] 
So (y(#).u(t)) ED 
(a minimizing sequence). 

If a minimizing solution (¥(t), Z(f)) CD does not exist, the functional 
(1.1) is minimized by finding a minimizing sequence. Once a minimizing 
sequence has been found, we can approximate as close as desired to the 
optimal control, always remaining in the class of admissible functions. 

A construction of a minimizing sequence for the classical object of 


variational calculus — the functional f f(t, y, y’)dt — is described in /4—6/ 


(see also Supplement). The main feature of this construction is that the 
sequence of paths ys(t) in the phase space goes to some function y(é) (the 
zero closeness function), whereas the sequence of control functions us() 
has no limit, going to infinitely frequent switching between several (two 
for a plane problem) fixed control functions up (!). These terminal 
functions y(f) and ue (t), B=1, 2, fully describe a minimizing sequence. 
Sequences of this kind define what is known in automation theory as 
sliding control. Technical examples of a minimizing sequence are 
provided by the "intermittent thrust control" of an aircraft ensuring 
maximum range with the engine being switched on and off with the highest 
possible frequency (see Supplement) of the optimal pulsating punching schedule 
with the press operating at the highest possible frequency /7/. 

In this chapter we will describe some methods of solution of variational 
problems for the case when no minimizing solution exists in the class of 
admissible paths, and investigate the properties of minimizing sequences. 
in terms of the corresponding algorithm, these problems are part of a wider 
class of so-called degenerate variational problems, whose solution involves 
a number of specific difficulties. In particular, the methods of the previous 
chapter are ineffective for the solution of degenerate problems even if the 
minimizing solution is contained among the admissible pairs y(t), u(t). 

The theory developed in this chapter is in fact the theory of degenerate 
variational problems. 
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Let us first consider a particular case of the variational problem 
advanced in § 1.1, which illustrates the specific difficulties encountered 
in the solution of degenerate problems. 


§3.1. A PARTICULAR PROBLEM 


Let f‘,=0,1,..., 2 be functions of the one variable u, let V,(t) coincide 
with the entire space Y for any t€ (fo, ¢:), and let Vz be a closed region 
independent of tf and y. The end points are fixed, ¢t=0; t=tir; y(0)=0; 
y(ti) =y18. 

To solve the problem, we make use of Theorem 1.1. We have 


R(t,y,t)= oi (t,y) F! (4) — fF (ue) + % (4,9). (3.1) 
If we take g(t, y)=py, where p=const, the function 
R=R(u)=,f' (u)— f(a) (3.2) 


is independent of y and the sufficient condition of an absolute minimum of 
the functional on the pair (y(t), u(¢)) € D takes the form 


R(u)= ep Re): (3.3) 


Let R(u) have a single supremum point # for any p. This point is 
clearly independent of ¢. Then, by Theorem 1.1, the pair y(t)=f(Z)t, @() 
is an absolute minimizing solution of the functional. The vector p=const 
is defined by the condition that the straight line (f¢, 7(f)) passes through 
given points (0, 0) and (1, yif) of the ( ft, y) space. For simplicity, a 
three-dimensional space is assumed, i.e., n=2. If the position of the 
terminal point ( fig, yi) is altered, the vector p, the optimal control 
function @, and the direction of ¥ of the straight line y(t) all change. 

Now suppose that there exists a vector p=(p, 2) such that R(u) has at 
least n+1=3 suprema uw), U2, us where 


1 f(a) f?(u) 
1 f'(u2) f? (us) 
1 f' (as) f? (43) 


#0. (3.4) 


Inserting the control functions 4, uz, uy; in the equation y=f(u), we obtain 
three linearly independent directions in the three-dimensional space 
(t, y', y2), which may be defined by the vectors 


ap=[1, S'(4—), f?(zp)], B=1, 2, 3. (3.5) 


Any vector in the ( ¢,y) space, including the vector a(t, yl, 4?) may 
be represented as a linear combination of the vectors ag: 


a=y'ae, P=1, 2, 3, 


where y!',y?, y> are some numbers. 
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In other words, the origin may be joined to any point (ti, yj, y?) bya 
polygonal line consisting of straight segments parallel to the vectors ag, . 
This polygonal line belongs to the class of admissible paths if and only if 
the coefficients y!, y?, y? (the projections of the vector a onto ag¢) are 
non-negative: 


vy? >0, p=1, 2, 3. (3.5a) 


Condition (3.5) has an obvious geometrical interpretation: the terminal 
point (fit, yl y%) can be joined to the initial point (0, 0, 0) by a polygonal line 
consisting of segments parallel to a, a, a3, which belongs to the class of 

admissible paths only if this point lies inside 
the trihedral solid angle (cone) » with its apex 
y? a at the origin, spanned by the vectors 4@, ao, a3 
(Figure 3.1). The situation is entirely 
a similar in the n-dimensional problem. 
Each of these polygonal lines is an absolute 
@e W minimizing solution. It corresponds to a 
t,t piecewise-constant control function 2 (?) 
/ taking on the values u, uy, u3. Indeed, any 
5 ye such pair #(f), Z(t) belongs to the class D of 
Jy @y admissible solutions and satisfies all the 
conditions of Theorem 1.1. It (4, y)E€ o, 
FIGURE 3.1 there are obviously infinitely many such 
minimizing solutions. We can thus always 
select a minimizing solution consisting of the 
least number of straight segments (and the least number of controls), 
If the point (tif, yif) lies inside the angle », this number is n+1=3; if it 
lies on one of the faces of the trihedral angle, n=2, and finally if it lies 
on one of the edges, n=1. 

Thus, the minimizing solutions of the functional are of fundamentally 
different form, dependeing on the properties of the function R(u). Indeed, 
if R(u) has a single supremum point @ over u€ V,, the minimizing solution 
(y(t), u(t)) CD is unique. To find this minimizing solution, we have to solve 
avery simple boundary-value problem: find a vector p= (1p, 2) such that 
the straight line y=/[u(p)]¢ passes through the point (¢1f, y};, y?-). If, however, 
a vector » exists such that R(u) has three supremum points ug, satisfying 
the condition (4, yif) Co, the functional has infinitely many minimizing 
solutions, which are polygonal lines. To find the constants 1, tp2 and the 
control functions ug, we do not have to solve any boundary value problems: 
they are independent of the terminal point (‘if yif)€ m and are determined 
from the finite relation (3.3). 

Condition (3.3) in our case contains five equations: three necessary 
conditions of a local maximum, e.g., Ru(us)=0, B=1, 2, 3, if ug lie inside 
V1, and two necessary conditions of an absolute maximum at each of the 
points us: 


R(u1) = R(up), P=2, 3. (3.6) 
The other conditions included in (3.3) are inequalities. The number k 


of finite equations (3.6) defining the vector p will be called the 
degeneracy ofthe variational problem. The degeneracy k can be 
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defined as the number of linearly independent vectors ag, (the dimension of 
the solid angle w) minus one, The above cases correspond (in the same 
order) to k=0 and k=n=2., 

An intermediate case, k=l, is also possible. Here, for some p= (11, 2) 
R(u) has only two suprema, u,and uz, suchthatthe vector a= (ff, yl, y;;) may 
be written inthe form a=y'ds, where dg =[l, [!(ua ), f2(ue )J, y? SO, B=1, 2. In 
thiscase, wis aplane angle spanned by a,;and a2. The functionalhas infinitely 
many minimizing solutions, and the corresponding paths inthe (f, y) space are 
polygonal lines consisting of straight segments parallel to a; and a2. In 
contrast to the case k=2, the constants 1, 2 and us depend on the position 
of the terminal point ( fit, yy, y,? ), although they are "less sensitive" to this 
factor than for k=0. For k=0, two boundary conditions had to be satisfied, 


A fifa, =yln i=1,2, 


and for k= 1 only one condition is to be satisfied: the vectors a,a, and 
@=(ty,4i,4?¢) Should be coplanar. 


§ 3.2. SUFFICIENT CONDITIONS OF AN ABSOLUTE 
MINIMUM AND THE CONSTRUCTION OF A 
MINIMIZING SEQUENCE 


Let us return to the general variational problem assuming a fixed f¢, . 
The sufficient condition of an absolute minimum for this problem is for- 
mulated by Theorem 1.1. 

Consider a minimizing sequence {y;(t), us(f)} ( D and a function g(¢, y) 
which satisfy all the conditions of the theorem. Let us analyze the structure 
of this sequence depending on the form of the function R(y, u, t). Let 
(y(t), @(t)) be the point of the (n+r)-dimensional space (y,u) where R(i, y, u) 
attains its maximum for a given ¢. 

Let there be one such point on some segment (1), t2)€ (to, #1). Let further 
y(t) be a bounded, continuous, and piecewise-differentiable function and 
u(t) a bounded and piecewise-continuous function. Then, by (1.11), 
everywhere on this segment 


ys(t) > y(f), and us(t) > u(). (3:7) 


Since {ys(#), us(1)} ( D, a necessary condition for the convergence (3.7) 
is that y =/(t, ¥, @) almost everywhere on (t, 12). Thus, for those points 
where the function R(t, y, u) has a single supremum point (y, uz), the 
minimizing sequence converges to (y(t), w(¢)), and this pair of vector 
functions should satisfy equations (1.3). 

Now suppose that on the segment (t, tz) C (fo, t:) there are m different 
control functions us such that R(f, y,u) has a supremum at every point 
(7, up) of the space (y, u) (7 is the same for all the suprema), i.e., 


Rt, y@), ae) =e), B=1,2,....m, (3.8) 
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and let there be k+1<m control functions ug, B=1, 2,..., +1, such that 

the vectors as =[l, f(t, 7, u®)] of the (¢, y) space are linearly independent. 
Then, by (1.11), ¥s(t)—-y(t), whereas the control function ug(t) does not 
necessarily have a limit. It is sufficient if this control function "oscillates" 
between ue(t). The latter signifies that for any e>0 and a sufficiently large 
terminal S, one of the inequalities 


]us(t)—ue (t)|<e, B=1, 2,..., m, (3.9) 


should be satisfied everywhere on (t), t2), with the exception of a set of points 
whose measure goes to zero for S— oo, 

The limit function 7(f) in this case is not necessarily a solution of (1.3) 
for any u(t). It is necessary, however, that on (1, t2) the vector a(t) =(1,¥) 
of the space (/, y) belongs to the angle off, 7(/)], which is a convex envelope 
of the vectors 


ap=[1, SUE, y, ue), FUE, y, Up). F(t, Ys He), 


P=1, 2,..., e41. 

In other words, it is necessary that there exist some functions y® (ft), 
p=1, 2,..., &+1, such that on (ti, t2) the vector (1, ¥) is representable in 
the form 

a(t)=(1,y)=y?-a9; vP(d)>0, (3.10) 
y=V (t)F (t,ysma), (3.11) 
or 
R41 
DX v=; D0, B=, %..,8 41 (3.12) 


(summation over f is implied). The functions y® (f) are piecewise-continuous. 
Asaresult, y(t) is piecewise-differentiable. 

The higher the dimension &+1 of the angle w, the weaker the ''coupling" 
imposed by (1.3) on the vector function y(t). For k=n, the condition 
(1, 7)€ » imposes simple inequality constraints on y(t), instead of equations 
(1.3). For k=0, conversely, j(t) should be the solution of the equation 


y=S lt, y, ue), 


where ug is any of the maxima of the function R for a fixed y=g(t). 

We will now show that if (1, ¥7)€ @ on (ti, 1), we may indeed construct 
a sequence {ys(t), us(t)} C D satisfying (1.3) and (1.11) on (t%, t2). Moreover, 
we will actually construct this sequence. We partition the segment (t, 12) 
into S intervals by the points mn=%<fi<...<tS <t. This partition is 
expected to satisfy one condition only: for S—-co, 


As = max|f+1—21|—0, y=0,1, 2, ..., S—1. 


for S+eo 


Let = 
ys (t= y(t) + 0(As), 
0 (4s) 
As 
Every point (#1, y,(¢)) in the ( ¢, y) space is joined with the adjacent 
point ( /1+!, y, (t+!) ) by a polygonal line consisting of straight segments 
parallel to the vectors 


when —0 for As > 0. 


aa(tv=(1, f[A,y(e), ap (I). (3.13) 
P=1, 2,.., A+1. 


The polygonal line y,(f) constructed in this way is a solution, to terms 
of the order O(As), of equation (1.3) with a piecewise-constant control 
function ugs(t), which is equal to ug(ét) over the part y*(f")A, of each 
interval Ay=¢1t!—?1, If (1.3) is to be exactly satisfied, the straight segments 
parallel to the vectors a®(t1) should be replaced by the solutions of the 
equations 


y=S [t, Y, Up (*)]. 


The sequence {ys(f), us(t)} constructed in this way satisfies all the 
conditions of the theorem on (1, t2): it satisfies equations (1.3) and for any 


t € (m1, 12) ’ 
Rg, Ys, Ws) ae), 


where 
LA=RIt, yO, 49 (8) 


is bounded and continuous almost everywhere on (t;, t2) in virtue of the 
properties of the functions R, y(t), us(t). Thus, if this sequence satisfies 
the conditions of the theorem on [f, t)], i.e., beyond the limits of the 
interval (ti, t), it is a minimizing sequence. 

The minimizing sequence constructed by this method has much in 
common with the sliding control functions of relay systems in automatic 
control theory, which switch from one state to another. For this reason, 
it will be referred to as the optimal sliding control. 

The limit function g(t) of a sequence of optimal paths ygs(f) will be called 
the zero closeness function of the sliding control. In the limit, the sliding 
control may be treated as "sliding motion" along the path ¥(¢) with infinitely 
frequent switching between different control functions ug (f). 

The control switching may occur in any order, as long as the 
approximating path yg(¢) remains in a sufficiently small neighborhood of 
the path y(t). 

The optimal sliding control is fully defined by the zero closeness 
function #(t) and a set of k+1 basis control functions u(t), B=1, 2,..., R+1. 
To find this control, we have to find its defining characteristics. 

The least number / of switchings of us(t) on every small interval 
A, = (é, 21+!) € (t1, t2) will be called the branching of the optimal 
control on (11, t2). Branching can be defined as the least number of 
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straight segments which make up an elementary polygonal line in the (¢, y) 
space minus one. We have an obvious inequality [<k. 

Note that in our construction of the minimizing sequence, we dealt not 
with the total number m of the suprema (J, ug) of R(t, y, u), but with the 
number k+1< m of the basis control functions ug such that 


R(t, 9, 4s J= p(t), B= 1, 2,..., R41, (3.14) 


and the vectors ag[l, f(us)] in the (¢, y) space are linearly independent. 
Condition (3.14) is equivalent on (1%, t2) to condition (1.11) if the sequence 
[y.(t), us(t)] is constructed by the above method. 

The number & is called the degeneracy of the solution of the variational 
problem on (t;, t2). This number, as we have noted before, characterizes 
to what extent the zero closeness function is "independent" of the constraints 
(1.3). The control functions ue (¢) defining the basis vectors ag, B=1, 2,..., 
k--1, which span the angle w(t) will be called the basis functions of the 
optimal sliding control. 


Let 
Aft, yO, oe, l=eOF lA 9O, AI-P  ), (3.15) 
where 
eA)=ylt, y(d]- (3.16) 
By (3.14), 
Altg(t), ¢O, 4l=Kn()= sup Ht, 9, $2), al, (3.17) 
ueV (ty) 
B=1,2,...,k+1 
or 
oft, yl), uel—S°Ut, 9 (4), to] =r), (3.18) 
P=1, 2,..., A+]. 


Since the vectors as are linearly independent, the matrix 


1 f(y), £7 (ar) 
Sete See Sort (3.19) 


is of rank k+1. This matrix coincides with the matrix of the coefficients 
before pi, pin (3.18). 

A degeneracy of & thus corresponds to the existence of k& independent 
finite equations (3.18) to be satisfied by the vector w(t) and the function 
wi(t). If on (ti, t2) the solution has a degeneracy k, then a sliding control 
with branching [=k, a zero closeness function y(t), and basis control 
functions ua(¢t), B=1, 2, ..., k+1, is optimal on this interval. 
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Is this sliding control unique or do there exist other optimal sliding 
controls with the same zero closeness function but a smaller branching on 
(t1, t2) ? A sufficient condition for the existence of such controls is the 
existence of an angle o\[t,7(t)]of dimension &:+1<A+1. In particular, for 
the zero closeness function j(t) to be part of the phase path of an absolute 
minimizing solution, it is sufficient that there exists a control function 
v(t) CV. which, first, satisfies the equation ¥=/(t, y, v) on (ti, t2) (the 
angle o, in this case k,=0 degenerates into the vector (1, 7) and, second, 
satisfies the condition 


Alt, 7(t), p(t), =Ale, F(t), w(t), 4p l=mi(d). (3.20) 
For example, in problems which are linear in control functions, 


Si=Al(t.y)+ Bi(ty)u, i=0,1,2,...2, 
jJ=1,2,...7, 
such a solution always exists, and a minimizing solution (7(t), w7(t))€ D 


therefore also always exists, although along a so-called singular curve 
y(t) satisfying the equalities 


(2) BS [4,y(]— Bilt.gO)=0, (3.21) 
jJ=1,2,...,9, 


the degeneracy is k=qg. Indeed, by (3.21), the function 


H(tyO, 9), wl = $8 ju! — By +H A(t, yO)— A yA), (3.22) 
i=1,2,...,7, 
J=9+1,...7 
is independent of the control functions u/, j=l, 2, ..., q. 
Therefore, any control function o(t) = {u!, ..., uw’ }, where the first 


components are arbitrary and the last components maximize (3.22), satisfies 
(3.20). In particular, a control function with first g components satisfying 
(1.3) also satisfies (3.20). 

Irrespective of the degeneracy k&, the minimizing solution (y(t), u(t)) 
exists in the class of admissible solutions D if the branching / is zero. If 
on some (t, tz), J>0, no minimizing solution exists in D, and a minimizing 
sequence has to be constructed as described above. This solution of the 
variational problem will be called an optimal control with branching on 
(t1, T2), Or an optimal sliding control on (t, t2), or simply optimal control 
if there is no danger of confusion. 

Theorem 1.1 reduces the problem of minimizing a functional on a 
set D to the problem of finding a maximum of the function R(t, y, u) for 
every fixed ¢. Our unknowns in this problem are the minimizing sequence, 
defined by the zero closeness function y(t) and the basis control functions 
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up, and the function o(f, y). The conditions of the theorem do not prescribe 
an unambiguous choice of the function g(f, y). Making use of this ambiguity, 
as in Chapter II, we may select different algorithms based on Theorem 1.1 
for the solution of the variational problem. 


§ 3.3. GENERALIZATION OF THE 
LAGRANGE—PONTRYAGIN METHOD 


For simplicity, we assume that 7(f) lies inside Vy for all t€ (fo, t,), 
the vectors yo=Yy(fo), yi=y(t) are given, the region V, is defined by the 
inequalities 


Juv |S1, j=1, 2,..., 7, 


and the functions f! (¢, y, u), i=0, 1, ..., 1, are continuous and twice 
differentiable. 

Consider an optimal control with a zero closeness function y(t) which 
satisfies all the conditions of Theorem 1.1, i.e., equality (1.11) in our case. 
On (ti, tz) C (to, 41), this optimal control has a degeneracy k. We will 
write out all the equations which are contained in (3.14) and which should 
be satisfied by the sought functions on (t%, t2). The function @(Z, y) is 
assumed to be twice differentiable in the process. 

1, Condition (3.17) containing & equations (3.18) and r(k+1) conditions 
for a local maximum of H: 


either 
Hy=0, |ul<1, (3.23) 
or 
“a= +1, (3.24) 
where 


B=1,2,...R41; J=1,2..47- 
2. The necessary conditions for a maximum of R with respect to y: 


Ri hyt=o, SF (by Mate teh i byte)— Spy) yz =O 
or 
7 uf) —2_ (0%: = 
R (ty, )= —— (si J+A (Leu.yste) | ero (3.25) 


The index § in the first term in (3.25) signifies that the total derivative 
of »,, (4, y) is taken in virtue of the equation y=f(t, y, up). The total number 
of equations in (3.25) is (k+1)n. 

Using (3.11), we may write 


WO=— eo leg Ol=v lef. 


For every fixed i, we multiply each of the k+1] equalities in (3.25) by 
y? and add them up. Using the last identity, we obtain 


by +H (,4,4,45) = 0, (3.26) 
1=1,2,..4; B=1,2,...,2-+1. 


Let us write the necessary conditions for a maximum of R(t, y, u) on 
(t1, t2) supplementing them with conditions (3.11), (3.12) which signify that 
the sought minimizing sequence belongs to the set of sliding controls on 
(t1, Te): 


y= yf (t,y,ia); 
b+ y®H, (t,y,4p) = 0; 
A[t9.g@, B= Sup H(t,~(), y(0),2)3 (3.27) 


k+l 
DV=1; > 0; P=1,2,...241. 
Bol 


Here H is defined by (3.15) and (3.16). 

This system is an analog of the equations of Pontryagin's maximum 
principle for a minimizing solution of class D. In case of zero degeneracy 
(k=0), equations (3.27) coincide with (1.3), (2.4), and (2.5) of the maximum 
principle. 

Equations (3.27) contain 2n+r++k-r+1! unknown functions: these 
include n phase coordinates y!(t), m functions y,(t), r(k+1) components 
of basis control functions ua, and k+1 factors y# (t). The number of 


equations is also 2n+r+k+k-r+1. Indeed, we have 2n equations from 
(3.11) and (3.26), one equation from (3.12), and &+r+r-k equations from 
(3.17). 


The number of equations in (3.27) is thus equal to the number of 
unknowns. 
Equations (3.27) include k independent finite relations 


H (4,4, 9,26 (4,9,9)) — H (type, (f,4,p)) =0 ’ (3.28) 
B= 2,3,....24+1, 


which follow from (3.17). These relations indicate that the function 

A(t, ¥, , u) for fixed ¢, y, p has equal values at allthe suprema u. Hence 
it follows that the initial conditions (¢, y, p) for the 2n differential equations 
contained in (3.27) cannot be chosen arbitrarily: they must satisfy (3.28), 
and so the order of the system (3.27) is at most 2n—k. 

Conditions (3.28) should be satisfied identically along any solution of 
(3.11), (3.26) on (vi, t). Setting the total derivatives of (3.28) with respect 
to ¢ equal to zero and insering for p and y the right-hand sides from (3.11) 
and (3.26), we obtain the following system of linear equations in y?: 
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ve [Y, (3.29) 


eS (4) — Aas [41] =—H, 
a=2,3,....4+1; B=1,2,..,A+1. 


ue 
u,’ 


Here 


Jie =F (ud—S (tas FU)=F (hy te), 


etc. The left-hand sides contain scalar products of the n-dimensional 
vectors f and Hy=(H,:, Ay, ..., Hy). 

If equations (3.29), (3.12) are compatible for any ¢, y, p, they define the 
coefficients y*. However, these equations may prove to be compatible only 
for those ?¢, y, ) which satisfy certain finite relations. This will evidently 
lower the order of system (3.27) even further. 

Let k=1. Then (3.28), (3.29), and (3.12) take the form 


H (t,9,), 42) — Hf (6,4,9,21)= 9; (3.30) 


VY" [(Hy (2) — Hy (021) F (1) — Hy (ay) (F (42) — F (1) + 
ty? [Ay (2) — Ay (Hy) f (4a) — My (2) F (2) — F (41) = — Ay (ta) + A (a); (3.31) 


yty=l. (3.32) 


Collecting similar terms in (3.31), we obtain 
(vv?) [Fy (22) f (t:) — Ay (ay) F (ua) = — A, (42) + HY, (2), 


whence, using (3.12), we obtain an additional finite relation supplementing 
(3.30) 


[Fly (242) F (1) + Ae (42) — [Ay (4) F (42) + A (4) =9, 


so that system (3.27) in this case is at most of order 2n—2. 

The general procedure for the construction of a solution by Lagrange's 
method is not very clear at this stage. In principle, however, it can be 
described as follows. 

Choose an arbitrary initial vector »°=(f) and investigate the maxima 
of the function 


A (toy Yor W, H). 


Suppose that H (fo, yo, °, u) has a single absolute maximum w,. This 
means that for the particular ° chosen, the degeneracy is zero at the 
corresponding point, k=0, and equations (3.27) coincide with the ordinary 
equations of Pontryagin's maximum principle. Using these equations, we 
construct (numerically, step by step) the functions y(t), p(t), a(t). If for 
some ¢=1t, the function Alt, 7, p(t), u] is found to have several absolute 
maxima (basis control functions) u(t), ..., Um(f), m>1, and system (3.27) 
is compatible at this point, the construction of the extremal can be continued, 
in general, from this state in m different directions, 


(t. y)=(t, F(us)), 
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taking k=0, as before, and a sliding control of degeneracy k=m—1 can 

be constructed using equations (3.27), Each of these alternatives satisfies 
equations (3.27) equally well, but these equations only provide the necessary 
conditions for a maximum of R(t, y, 4). To choose the optimal alternative, 
we must rely on conditions (1,11). 

Suppose that the optimal branch on (1, #1) is a sliding control of 
degeneracy k=m—1. Equations (3.27) together with the initial conditions 
y=y(u1), p=p(t1) fully define this control. It is constructed as follows. 

Given (v1), p(vt1), we find the absolute maximum ug of H(1, u); from 
(3.29) and (3.12) we find the factors y*. Inserting these we (t1) and y? (1) 
in (3.11) and (3.26), we obtain y(t:), p(t), and applying any suitable 
numerical method of integration of differential equations, we construct 
step by step a solution of (3.27) up to t=). If y(t)s4yi¢, a different w°® is 
selected and the entire process is repeated; alternatively, for the same wp°*, 
some t €[ti, 4] is chosen and, if possible, one of the above control 
alternatives with degeneracy k<m—1 is constructed from this point. 

A characteristic feature of system (3.27) with k>0 is that its solutions 
are not unique. The number of solutions may be infinite, as Example 2.2 
in Chapter II shows (a degenerate problem with linear control), 

It should be stressed that in the degenerate case, equations (3.27) do not 
yield the necessary information even for constructing local minimizing 
solutions. With nondegenerate systems, on the other hand, the equations 
of the maximum principle are often solvable or have a finite number of 
solutions, so that in principle the minimizing solution can be found by 
constructing all the possible extremals and choosing the best alternative. 

For k>2, equations (3.29) are often solvable for y*, so that (3.27) is 
of order 2n—k, i.e., the set of its solutions contains 2n—k constants. 
Further analysis of the function R (see § 3.5) shows that the necessary 
conditions for a maximum of the second derivative of R are satisfied only 
by those solutions of (3.27) (if they exist) for which the coefficients of (3.29) 
identically vanish, so that the right-hand sides of this system are all equal. 


k(k— 


This generates ) 4% additional finite relations. If they turn to be in- 


2 
dependent, the overall order of system (3.27) does not exceed 
on—op—PLR=Y 


2 
The system of finite relations may contain less than 2k+ alee) in- 


dependent relations. 

For example, it can be shown that if the degeneracy is k=n and the 
integrand is independent of the control function and has a stationary point 
y(t) for every ¢€ (ty, #1), where y(f) is a continuous and differentiable 
vector function, system (3,27) is of order zero. 


§3.4, REMARKS CONCERNING THE HAMILTON — 
JACOBI— BELLMAN METHOD 


The Hamilton—Jacobi—Bellman method was considered in Chapter II for 
those cases when the minimum exists in D. Unlike the Lagrange method, 
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where Pontryagin's maximum principle equations are replaced by (3.27), 
the Hamilton—Jacobi—Bellman method is extended to the case of sliding 
control without any change. 

Indeed, consider the function P(t, y), (2.65), and choose g(f, y) so that 

P becomes independent of y, i.e., so that equation (2.66) is satisfied. 

As in Chapter II, the function (é, y) describing the optimal control field 
is defined by the partial differential equation (2.66) with the boundary 
condition (2.69). i 

Having found a solution @(t, y) of this equation, we obtain an optimal 
control function #(t, y) at every point (f, y), but this control is not 
necessarily unique. It is defined by equation (2.65) and in practice is obtained 
in the course of construction of Pft, y, (t, y)]. 

To obtain an optimal solution corresponding to some fixed initial 
condition (fo, yo)f, it is sufficient to integrate system (1.3), closed by 
equation (2.65), with this initial condition. If the solution of the problem 
with this initial condition contains a sliding control section in (1, t2), the 
required solution is realized automatically. Otherwise we would have ended 
up with a non-optimal solution satisfying the conditions of Theorem 1.1, 
which is impossible. 

In numerical integration, the sliding control section emerges as a 
control schedule with very frequent control switching. The switching 
frequency increases as the integration interval diminishes, and the sought 
minimizing sequence is thus indeed obtained by reducing the integration 
interval. 

We would like to stress some characteristic features of the partial 
differential equation (2.66) in this case. 

Let the function H(t, y, , u) have the form 


H=K(t, yp) +L(t, y, pe, 
where uC{—1,+1] is a scalar. The function P thus takes the form 
P=K(t,y, oy) +1L (4% oy) | +o 


i.e., the left-hand side of the partial differential equation (2.66) is non- 
differentiable. This situation is characteristic of all degenerate problems. 


§3.5. THE METHOD OF MULTIPLE MAXIMA 


Consider another method of solving degenerate variational problems, 
which differs from the Lagrange and the Hamilton—Jacobi methods. When 
applicable, it has a number of distinct advantages which will be considered 
below. This method uses a particular definition of the function 9/(f, y). 
First let us consider a particular problem of minimizing the functional 
(1.1). 
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3.5.1. The simplest functional 


Let 


t 
1=f PUt.yuat, 
prey (3.33) 
y=f(t,y,4); Ty <u<Th. 
Here y(f) and u(t) are scalar functions, 
Yo=Yot, Y= Yif- 
Let fu>0 for u€[T,, Ie] and let the region V,(t) of the admissible values of 
y be restricted to the solutions of the equations y=f(¢, y, Ti,2) which pass 
through the given initial and terminal points. 


Additional conditions may also 
be imposed on V,(?). 


At every point ¢,y€ V,(t) we find p= p(t, y) such that H= pf—/? attains a 
supremum at least for two values of u, 


A [t,y,o(ty), a= [4,9,4,9(y), 4249) =4 (49); (3.34) 


d(t,y)=_sup_ HH (¢,y,(4,y),“), (3.35) 
T,<u<T, 

and the vectors [1, f(u)] and [l, f(ue)] are linearly independent, i.e., 

f(u1) AF (ue). 


As we have seen before, relations (3.34) and (3.35) incorporate three 
equations in three unknowns 4, u2, p. Let Py<u;<Tgand w=le. Then 


A (t,9,4,4) |? = 


= oF (P2)—F (4)]— [79 (Pe) — Ff (a)] = 05 


(3.36) 
A, (a) = thn (4,)— fo (,)=0. 


Solving the second equation in (3.36) for and inserting the result in the 
first equation, we obtain an equation for 4. 
The function @(f, y) is now defined by the equality 
y= (4, y)s 
pw (3.37) 
oy=) oy) dy. 
The integral in (3.37) is indefinite, with fixed ¢t, since an additive 


constant in the expression for g does not affect the solution of the problem. 
Thus 


R (ty) = V(t.) F (t,y,4)— P(t,y,u) + | % (4,y) ey, (3.38) 


Pity) =supR=d (ty) + | w(ey)ay. (3.39) 


Maximizing P(t, y) with respect to y for every fixed t€ (to, ¢,), we finda 
solution g(¢) of the equation 
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P(t,y)= sup P(t,y)=p(2). (3.40) 
yeV y(t) 


The function y(t) in general consists of pieces of the boundary of V,(t) 
and the solutions of the equation 


R,(t,y)=0 (3.41) 


in V,(t). 
Let ¥(¢) be continuous and piecewise-differentiable on [%, t]. Let it 


further satisfy the inequality 
face y, u1) <9 <f(t, y, Ue), (3.42) 


which in our case is equivalent to the condition (1, 7)C w(é, 7(f)). Then 7(t) is 
a zero closeness function of an optimal control of degeneracy k=1 with the 
basis control functions 4, ue. 

Indeed, for every {C(t t1), the function R(t, y, u) defined by (3.38) has an 
absolute maximum at the points (¥, 41) and (J, us) of the plane (y, u). The 
problem is thus solved. 

The above discussion also reveals the characteristic features of this 
algorithm. Ina sense, it is ''antipodal'' to the Hamilton—Jacobi—Bellman 
method. Both these methods, in contrast to the Lagrange method, impose 
certain constraints on g(t, y) which should be satisfied identically in V,. 

In Bellman's method the optimal path is selected by choosing a control 
function u=z which ensures sup R, while R is maximized with respect to 


Vv 
y by an appropriate choice of ott, y). Inthe new method, on the other hand, 
a minimizing solution is obtained by choosing the point y=y in the phase 
space for every ¢ on which sup 2 is attained; the maximum with respect to 
u is ensured beforehand by an appropriate choice of g(t, y). Indeed, gis 
chosen so that at every point (4, y) there exists an angle »€ (1, j). 

The specific features of each method automatically define the range of 
problems to which they are best applied: the Hamilton—Jacobi—Bellman 
method is little sensitive (at least in principle) to constraints on the control 
function, while being extremely sensitive to constraints imposed on the 
phase coordinates, the constraints on the vector y included. For the new 
formalism, on the other hand, the constraints on the phase coordinates are 
of no significance. 

In virtue of the particular construction used, the method of multiple 
maxima is applicable only to degenerate problems (although not necessarily to 
branching control). This is atypical method for solving degenerate problems. 

The above discussion illustrates the efficiency of the method. Thus, 
instead of solving a boundary~value problem for a system of differential 
equations, whose solution is far from covering all the possible control 
programs (Lagrange's method), or solving a nonlinear partial differential 
equation (the Hamilton—Jacobi method), the method of multiple maxima 
reduces the entire problem to elementary relations. This simplification is 
made possible by the relaxation of the constraints on y(t), u(t) as a result 
of degeneracy. The condition of the strict equality of the vectors (1, 7) 
and (1, f (4 7, #)) in the (¢, y) space is replaced by the weaker condition that 
the vector (1, ¥) should belong to the angle (4, 9()). 
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We will now consider the generalization of the method of multiple maxima 
to some common multidimensional (n>1) variational problems. 


3.5.2. Systems with linear control 


Consider the following particular case of the general variational problem 
(Chapter I), The functional / is given by 


t, 
1=f F°(t,y,0) dt + F (Yo.41)s (3.43) 
fo 
the differential equations (1.3) have the form 
y= (4, y, w) =e(t, y, 0) +ALL y)w, (3.44) 
where 
u=(v, w), v=(v!, v2, ..., 077), 


and w is a scalar control function. 
The set V.(¢, y) is defined by the conditions 


vE Vi(t y), WEVu(h y); (3.45) 
yEVy(t), ¢€ [to A), (3.46) 


where V,(t, y) is a given set of points in the (r—1)-dimensional vector 
space, V,»is the segment [w(f, y), we(t, y)]. 
The vector functions g(t, y, v)=(g', g?, ..., g’) and A(t, y)=(A', A’, ..., bh") 
are continuous and differentiable for all t€ [to, ed yEV,(t), vEVo(t y)- 
Problems of this kind constitute a special class in variational calculus, 
since the standard classical necessary and sufficient conditions do not 
provide a solution for the local minimum of the functional in this case. 
Thus, Weierstrass's and Legendre's strengthened conditions /3/ a priori 
do not hold. Jacobi's condition is meaningless for these problems. 
To solve the problem, we will apply Theorem 1.1. We have 


Rit, y, v, w) =p, (g(t y, 0) +A(L y)w)— P(t, y, v) +o. (3.47) 


The function » being arbitrary to a degree, we define it so that for any 
admissible ¢ and y, 


pyh(t, y) =0. (3.48) 


Given this g, the function R is independent of w. 

Condition (3.48) is a partial differential equation for the function @. The 
general solution of this equation is an arbitrary continuous and differentiable 
function 


P=? (4,9), 
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where 


n=n(t,y); (3.49) 
nit yy=(y!, 2, ..., 4"7') is a set of the (n—1) independent first integrals 
of the system of differential equations 

Be a: 3.50 
dt =h(t,y). ( ) 


We choose 4, y as the new arguments of the function g,. This gives 


R=R,(4,N,9,0) = iq (3 syeiteacort) — F°(EY,R)+ ate (3,51) 


The problem now reduces to finding the maximum of R, for every fixed 
t€ (to, f1) on the set of points (n, ¥, v, w) in the (n+r+1)-dimensional set, 
which satisfy the conditions (3.45), (3.49) and the condition y EV,(Z) . 

Formally, the R defined by expression (3.51) is a function of the 
functional (3.43) and the following system of constraints: 


n 
A= DV ng (4.y,2) +053 (3.52) 


fm} 


w=ni(t,y), j=1, 2,..., 2-1, (3.53) 


where n/ are phase coordinates, y!, v*are control functions. 

We will show that in the absence of any constraints on w, the initial 
problem (we designate it Problem 1) can be reduced, actually as well as 
formally, to the problem of minimizing the functional (3.43) on the set of 
elements (n(4), y(t), ¥(4), w(t)) satisfying equations (3.51), (3.52) and con- 
ditions (3.45), (3.46) (we designate it Problem 2), 

Consider a set D, of elements (n(t), y(t), v(t), w(t)) satisfying the 
following conditions: 

1. The vector function y(t) =(y!, 72, ..., n’7') is continuous and piecewise- 
differentiable on [f, i]; ni are the new phase coordinates. 

2. The vector function (y(t), u(¢)) is continuous everywhere on [to, 4] 
with the possible exception of a finite number of points, where it may have 
discontinuities of the first kind. 

3. For every ¢€ [to, ¢,] the vector (n, y, v) belongs to the set Vj(¢) of the 
(2n+r)-dimensional space defined by conditions (3.45), (3.48), and (3.46). 

4. The functions y(t), y(t), v(t) satisfy equations (3.51). The set of all 
control functions (y(t), v(t)) is designated D,. Problem 2 is formulated as 
follows. Find an element (n(¢),y(£), v(t))€ D: on which the functional 
(3.43) attains its minimum value on D,. For the purposes of Problem 2 it 
is assumed that the sought element (n(f), y(t), v(t)) is contained in D,. 

All that follows can be generalized without difficulty to the case when the 
minimum of / is not attained on D, and a minimizing sequence is to be 
constructed in D,. 

Let us consider the simultaneous equations (3.44), (3.52) for (y(t), v(t), 
w(t))€D. Then, by (3.53), there are only n independent equations among 
(3.44), (3.52), and the equations in (3.52) are independent of one another. 
We will now show that D,>2D. 
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Indeed, let D,’ by a subset of elements of D2 satisfying all the conditions 
of D, with the exception of equations (3.44). Then D.’/>D, since system 
(3.52) contains fewer independent constraints than system (3.52), (3.44), 
which is equivalent to system (3.44). Hence, Dis a subset of Do, D,>D. 
Then, in virtue of the particular structure of the functional (3.43), we have 
the obvious relations 

inf / = in inf / < inf /. (3.54) 

Suppose that Problem 2 has been solved, say, by further investigation 
of the function R; and the conditions (1.12). Let (f(t), a(t). @(t)) minimize 
the functional (3.43) on D,, where @(t) may be any piecewise- continuous 
function on [f, ¢,], since the functional and system (3.52) are independent 
of w. We will now show that a function #(t) exists such that there is an 
element (¥(¢), 2(t), @(t)) satisfying equations (3.44) which satisfies to any 
desired accuracy the boundary conditions (3.46) and the constraints (3.45), 
(3.46) and approximates* to the element (f(t), 3(1), @(t)) € Do. 

Indeed, if #(¢) is continuous and piecewise-differentiable, with a finite 
number of points of discontinuity of the derivative, then inserting f{(/) in 
one of the equations in (3.44) (such that the substitution does not give a 
trivial relation of the form 0= 0) and solving the resulting equation for w, 
we obtain a function w(t) which, in virtue of the properties of (tf) and 
g,h has at most a finite number of discontinuities of the first kind. In 
this case (f(t), (1), H(t)) C D and we may take 


(y(t), B(t), w(t) = (F(t), BL), B()). 


If #(t) has a finite number of discontinuities of the first kind, we proceed 
as follows. Suppose that relations (3.49) are written in the form 


y=X(0,5,4), x= 0007, (3.55) 


where x'(n, 6, £) are some constraints which are continuous and differen- 
tiable functions of their arguments, 6 is some scalar variable. 

We know from the theory of ordinary differential equations that such a 
representation indeed exists (see, e.g., /2/). The substitution of variables 
reduces (3.52) to the form 


= v(N,6,0,t), v=(viv2,...,v-1), (3.56) 


where v/ (n, 6, v, !) are some continuous functions of their arguments, and 
(3.44) is reduced to the form 


}=v(N,0,0,1); > (3.57) 


6=vy (9,9,0,2) + vz (n,0, 2) w. (3.58) 


© We say that a continuous vector f function x(¢) approximates on [fo, 1) with an accuracy & to some 
piecewise-continuous function x(t) with a finite number of discontinuities of the first kind if 
| x(t) —x(t) }J<e everywhere on [fo, ¢], with the possible exception of the e -neighborhoods of the dis- 
continuity points of x(?). 
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The functional (3.43) and the other relations figuring in the problem are 
also expressed in the new variables. In (3.57), (3.58), 6 is a control 
element, whereas in (3.56) it is a phase coordinate. Let 6(f) be the control 
function 6(¢) corresponding to the solution of Problem 2, In our case, 6(t) 
is some piecewise-continuous function with a finite number of discontinuities 
of the first kind (Figure 3.2). Evidently, 


y¥(N=x(n(2), 0(4, 2. 


We partition the segment [fo, ¢|] into subintervals A, so that the dis- 
continuities coincide with some of the partition points. Joining the points 
(ts, 8 (¢s)) by straight segments, we obtain a continuous polygonal line 
(¢, @s(t)) "inscribed" inside the curve (t, 6(f)). Here s=1, 2,..., S. 

Let us find the solution ys(/) of (3.56) for 6(¢) = Os(t), u(#)=d(t) and the 
initial conditions ys(t) =7(0) . First we will investigate the behavior of the 
solution ns(t). We see from the construction of 6s(¢) that for any arbitrarily 
small e>0, there exists 6 (for S—oo, max A;—-0) such that 


26 = (8()—Fg (| <e 


everywhere on [fy, 4], with the possible exception of the e -neighborhoods 
of the discontinuity points of 0(/). Because of the constraints, we can 
construct a closed bounded region G in the space of the variables n, 0, ¢ 
which contains all the admissible values of the variables corresponding to 
the solution of system (3.52) with the above initial conditions. Since the 
functions v/ (ny, 8, 4) are continuous, the increment 


bv=[v(n, 9(9, 9), t)—v(n, 85(0), 2, 4) 22 


is of the order e, i.e., Ov<ke for 60<e, where k is some constant. 
Near the discontinuity points of @(4), 6@and 6v are bounded, i.e., 
68</, dv<cm where J, m are constants. Hence we obtain an estimate for 


dn=|n(t)—ns (2) |: 


t 
n= J vO), D(A), t—vin, G5(0, O(0), dt < 
0 


fy N Th, os N 
<foar< (/ avdt + f ia) < vy [meets ee A i (3.60) 
0 1 i_as “n 1 
where tr, 2=1, 2, ..., N, are the discontinuity points of 6(t). 
8 a(t) 
ast) 
i) 
t 
FIGURE 3. 2 
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The same estimate evidently applies to 
él = | Ts—T| . 


Hence it follows that for S—oo(max 4s 0), liw—as (| >0 for all 
‘€[¢,,f,] (in particular, s(t) > n(4). 
Inserting 9s(#), s(t) in (3.55), we obtain 


ys(t)=x(ns(t), Os(2), 2). (3.61) 


The functions x(n, 8, ¢) being continuous, we obtain 


¥s(hi) 9 (4) 


for S—oo. 

Inserting 6s(¢), ns(t), 3(t) in (3.58) and solving the resulting expression 
for w, we obtain some piecewise-continuous function w(t) with a finite 
number of discontinuities of the first kind. 

The sequence (ys(t), 5(/), @s(t)) is a minimizing sequence, each of its 
elements satisfying equations (3.44) and, with an accuracy of ¢, the 
appropriate boundary conditions and constraints; in other words, it belongs 
to the set D with an accuracy of €. 

We thus arrive at the following general scheme for the solution of the 
problem. The starting system of equations is replaced by equations (3.52), 
(3.53), where the phase coordinates are the first integrals of (3.50) and the 
control functions are y(t), v(t). In virtue of the properties of the first 
integrals, this system is independent of the control function w(f). We thus 
define a set D2, of the elements (y(t), v(t)). We then minimize the functional 
(3.43) on the set De (in a certain sense, this is a simpler problem, because 
the new system of differential constraints has a lower order than the 
original system). The solution obtained in this way either belongs to D or 
can be approximated with any desired accuracy by a sequence in D which 
converges to this solution and is at the same time a minimizing sequence 
of the original problem. In our case, when the control function wis un- 
constrained, Problems 1 and 2 are in fact equivalent, 

In general, when V,-(t, y) is bounded, Problems 1 and 2 are not 
equivalent. The above scheme may be applied in this case also: the only 
additional step is to check that the function w(t) satisfies the constraints. 
If it does not, the sought solution will contain sections corresponding to 
the boundary values of w. Ourscheme is inapplicable in this case for a 
rigorous solution of the problem. However, the solution of Problem 2 
again proves quite useful: it helps to form a qualitative idea of what the 
sought solution should be (the discontinuity points of §(t) generally corres- 
pond to the boundary control w) and to obtain a lower-bound estimate of 
the sought solution (relation (3.54)), The conditions of the optimum 
principle (1.11), (1.12) are naturally valid in the general case also. The 
sought minimizing solution in this case is probably a combination of seg- 
ments corresponding to the interior points of the set V,, (singular sections) 
and segments corresponding to the boundary of V». On singular sections, 
the function 9 is naturally defined as a solution of the partial differential 
equation (3.48), and after that we proceed to investigate the function R). 
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Singular sections are described by equations (3.52), (3.53), On sections 

corresponding to the boundary w, g should be defined proceeding from 

the specific features of the particular problems being considered (see 3. 5. 4). 
Finally, we obtain equations which describe sections of different types 

and the function @ on these sections. The last step in this solution 

procedure is the appropriate matching of the different sections and a final 

verification of conditions (1.11), (1.12). 


3.5.3. A degenerate quadratic functional 


The above method will now be applied to minimize the functional 


ty 
i=) (2: (t)y'y! +6, (4) y'w)dt (3.62) 
with the constraints 
y=M(t)yt+Liijo, (3.63) 
y (fo) =y (ti) =0, (3.64) 


where 
y= (9, Peo): 
M(t) and L(t)=(L!, L?,..., L") are respectively an nXn matrix and an n - 
dimensional vector of the coefficients of linear system (3.63). Summation 
over repeating indices is implied, 
The problem should be reduced to the form (3.43), (3.44), and to this 
end we will treat it as a problem of minimizing y°(t,) for the system 


y? =a4ij(t)y'y/ +5,(¢) y'w, 
y=M(t)y+LOw, 


(3.65) 


with constraints (3.64) and an additional condition y(t) =0. 

In this case f(t, y, 0) =0; Fly, w)=yis g(t y, 0) = (aiy'y’, M(t)y), 
h=(b(t)y, L(t)). 

Let us find the independent first integrals of (3.65) 


dy0 
=) (3.66) 
d 

Salt). (3.67) 


They have the form 


1 
2(L")2 


w=y——— (ayn) y" — L(y"); (3.68) 


16 


Li P 
7s j=1,2,....2—1. (3.69) 
Equations (3.52) take the form 


fe 1 1 
bea a errs 5;(Mly) a em) Mry— 


—[em, (-~) = 7 (oyn))} |e oy ot | x 
x My y'—[r a (4) +4 (Sam oL)|u" (3.70) 
jv My ry | t ye" (3.71) 


where y!,7/ satisfy (3.68), (3.69); M/is the j-th row of the matrix M. 
Expressing y/ in terms of / and y" from (3.68), (3.69) and inserting in 
(3.70) and (3.71), we obtain 


A= galt) nen + Ben (AY + Ban OY), (3.72) 


nt= mi (tn +l (dy, (3.73) 
Ak, £=1,2,....2—1, 


where @ai(t), gan(t), Zan(t), m* (é), /*(t) are the coefficients obtained after 
the substitution and collection of similar terms; 1* (/)are the new phase 
coordinates; y” (f) are the new control functions. 

If gnn30, Problem 2 is non-degenerate and can be solved in finite form 
by the standard methods, e.g., by applying Jacobi's classical condition. 
The original problem is thus also solved. 

A problem of this kind arises in connection with the second variation of 
the functional if the extremal (the solution of the Lagrange—Euler equations) 
is degenerate, i.e., if Hy»=0 along the extremal. 


3.5.4, An example with constraints on w 


Minimize the functional 


t, 
r= J (y'—(yp)dt (3.74) 

6 

with the constraints 
y= yw; (3.75) 
y=“ — a; (3.76) 
yl 

y'>0; |w| <A; A>D0; (3.77) 
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YO=y¥(0)=0, YH=y¥, ¥(t)=y, (3.78) 


where f?, is free. 
To solve the problem, we find the first integral of the system 


dyl _ dy? _ 
ie Oe ge (3.79) 
The first integral is given by 
n=y'+ = (y. (3.80) 


Changing over to new variables n, y', w in (3.74), (3.75), (3.76), we 
express (y?)2 interms of y and y! from (3.80): 


un 
1 =I (3y!—2n) at; (3.81) 
Q 
7a (3.82) 
Y=V2n-y)o. (3.83) 


Since the right-hand sides of (3.81) — (3.83) are independent of ¢ and 
is free, we can conveniently change over to a new argument 7. By (3.77), 
720 » and this substitution is legitimate. The problem is thus reduced to 
the equivalent problem of minimizing the functional 


un MN 


3y1— 2 
n=) fyindn=\ y! 3 = 79 ay (3.84) 
ny 
No No 
with the constraints 
1 
(y') =f (y,0,w) = —H— (3.85) 
V 2(q~—y)) 
1 2 
M=9; N= yi +> (vil (3.86) 
¥(M)=0; y(N)=y}; O<y' <7. (3.87) 


To solve the problem, we investigate the maximum of the function 


R=o ae a be: (3.88) 

R attains a maximum with respect to w under the following conditions: 
ey =0, any w (3.89) 

gy > 0, W=A; (3.90) 

gy <0, w= —A. (3.91) 
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In the first case, R takes the form 
1 
Ray aA SE . (3.92) 
n—yl 
The dependence of R on y! for some fixed y is shown in Figure 3.3. 
It is readily checked that the maximum R for every fixed y is attained 


for y}(n) = (i— 4)n (see Figure 3.3). The solution of Problem 2 thus has 
the form 

yi=(1—-V1B)1 (O<n<n); 

pes H (1=M,). (3.93) 


It is readily seen that this solution gives an absolute minimum of the 
functional (3.84), if the differential constraint is ignored. 


RFU Yq yy 


FIGURE 3.3 


Inserting (3.93) in (3.85), we see that w=co for 7=0, n=n1, So that the 
constraints imposed on w break down. Therefore, on the initial and the 
final sections, wis boundary control. Using this information, 
we construct a trail minimizing solution, which consists of the following 
three sections (Figure 3.4): 

I. O<n<npe, (¥(n))1 is the solution of equation (3.85) for w=A and the 
boundary condition y!(no) =0. 
I. <<, A ag ae 

Il. 43<1<,, (y(n) is the solution of equation (3.85) for w=A and 
the boundary condition y'(n)=y}. 

The function p(n, y'!) is defined on I, II in the form 


9(1, y= (n) y!+0(n)(y!— y(n), (3.94) 


where p(n), o(n) are some continuous and piecewise-differentiable 
functions of 7. 

The sufficient conditions of a local maximum of R on I and III have the 
form 


Oy: >0; (3.95) 


Py=Ry (n, y', A)=W+Hy =O, (3.26) 
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FIGURE 3.4 


Py = Ryy (My! A) = 0’ +29 a) A+ Hyy <0, (3.97) 


where 
H=o¢f (y'n,A)— f° (y',n), 


P(n!)= sap hate): (3.98) 


At the points n2, nz we should have 
N=Ny (9: =0, (2) =0; (3.99) 


N=N5, (Yi =0, (9) = 0. (3.100) 


Otherwise, the function (yn, y') will be discontinuous in y! at these 
points (on section IJ, gy: =0). 

It is readily seen that the solution of equation (3.96) on III with boundary 
conditions (3.100) is positive. 

Indeed, consider the general solution of (3.96) 


un up 
gae Ff e(nyeFdy; F (n= A(n)dy, 


no 


where A(n) is a coefficient before p in (3.96); g(n)=ff,. 

On this section Foy: >0 (see Figure 3.3). Hence p(n)>0 for all 
n€ (ns, m1). 

A similar situation is observed on section I. The only difference is that 
the boundary condition here is defined for the right end point and f}, <0, 
since y'<y,. Here again p(n) >0 everywhere on [mo, ne]. Since g, is 
continuous, we conclude that there exists a neighborhood of the curve 
y(n) where 
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9,20. (3,101) 


Consider condition (3.97). This inequality may be replaced by an equation 
with a positive right-hand side 


+s fpwA+ Hyyr=e(N), (3.102) 


where e(n)>0 is some piecewise-continuous function of y. 

Equation (3.102) with boundary conditions (3.100) is a priori solvable 
on I and III (a Cauchy problem for a linear differential equation). Hence it 
follows that there exists a function (ny, y') such that the solution 
constructed on I, III corresponds at least to a local maximum of P(y', n), 
and the solution on II corresponds to an absolute maximum of R(n, y!, v). 
We thus see that the solution constructed above ensures at least a strong 
local minimum of the functional in our problem. 

In this example, Problems 1 and 2 are not equivalent because of 
constraints (3.77) imposed on the (ontrol function w. Asa result, we did 
not change over to Problem 2 at the very beginning. However, we did 
change over to new phase coordinates (y!, n) and the solution of the problem 
was thus considerably simplified. 


Remarks 


1. Problem 2 (the function R, ) may turnout to have the same singularities 
as the original problem (the function R). The method of multiple maxima 
then can be applied repeatedly to analyze Problem 2. 

2. A practical shortcoming of the above method is that it requires 
determination of the first integrals of some system of ordinary differential 
equations (3.50). In applied problems and, in particular, in most problems 
of flight dynamics, equations (3.50) are sufficiently simple and their first 
integrals are obtained without difficulty. 

3. The above method can be generalized without any changes to the 
case when the vector function h depends on the control function v, but so 
that for fixed ¢ and y the vector A maintains a constant direction when v 
is varied. In other words, the unit vector function A(t, y, v)/|A(t, y, v) | is 
independent of v, and no constraints are imposed on w, 


We write A(t, y, u)w inthe form h,(¢, y)w;, where h,= = w,;=|A|w, and 
(ab 


w,is used as the new control function. The problem is thus reduced to the 
one considered above. 


3.5.5. Systems with several unconstrained control functions 


Let the functional (1.1) and system (1.3) have the form 


t 
p=) F°(é,y) at + F (Yo, 41): (3.103) 
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k 
y=ety+> t(ty)e', (3,104) 
l=1 


where uw‘, [=], 2,...,&, are the components of the control vector u. The 
set V, coincides with the space U. The vector functions fA; are continuous 
and differentiable for every t€ (, t1),y€ Vy, and the functions g, f° are 
moreover bounded for bounded t,y,u. 

Although the problem can be solved by the method of the previous sub- 
section, using several recursive reductions to Problem 2, we will describe 
a different approach which a priori enables us to indicate the conditions to 
be satisfied by the vector functions h,(t, y) so that the function 


R=%, (con+ hitam'|— eshte (3.105) 


for every 1t€ (to, ti) attains a maximum on 
y(t) € Vyft), a(L)EV,, [a] soo. (3.105a) 


For simplicity, we assume V,(t) to be an open bounded region. We write 
R in the form 


R=A(t, y)ut+B(t, y), (3.106) 
where A(t, y) is a vector with components g,f(f, y), /=1, 2, ..., k, 
Bit, y) =gye(t, vy) +er—P(t, 9). 


First we prove the following lemma. 
Lemma. For the function (3.106) to attain a maximum on (3.105a) for a 
fixed ¢, it is necessary and sufficient that 


1) A(t,y)=0, (3.107) 
2) BAGS ee BAGd): (3.108) 


Proof. Necessity. We write the vector u inthe form 
u=ov, 
where v=|u]; Q is a unit vector je]* 


The inequality 
R(t, y, u)—R(t, ¥(t), u(t)) <0 


is satisfied for all yC€ V,(t), u CU if and only if 


a R—R (typi) = max {A (4,y)ov+8 (¢,y)} ay 
—R(ty,4)=1A(ty)| v+ By) —R (t,y,4) <0 (3,109) 


for all yEV,(t), v €[0, +o]. 
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The maximum with respect to @ is accounted for as follows. The 
scalar product Ag is the projection of the vector A on the direction of @. 
It attains a maximum, equal to the magnitude of A, if @ points along the 
vector A. 

Suppose that (3,107) is not satisfied and A(t, y*(t)) 40 for some y=y*(t) € 
€ V,(t). Then, since the first term in (3,109) is positive and the terms 
Bit, y*(t)) and R(t, 7, @) are finite, we can find such v that inequality 
(3.109) is not true. 

This proves the validity of (3.107) and hence of (3.108), since from (3.107) 
we have 


R(t, y, u) = Ble y). 


The sufficiency of (3.107) and (3.108) is self-evident. Q. E. D. 
Condition (3.107) is a system of linear homogeneous partial differential 
equations for the function @: 


Liw@=> pyt Al (ty) =0. (3.110) 
i=l 


In general, such systems are incompatible. A necessary condition of 
their compatibility is that the so-called Poisson brackets for the functions 
Li(t, y, @y) vanish identically (see, e.g., /9/, p. 365): 


y, Off gy Onl 
(LrLm=J) ive ce aye (3.111) 


j=l \iH1 ial 


Conditions (3.111) constitute a new system of linear homogeneous partial 
differential equations for g. Seeing that 


(Li Lm) ad (Lm, Li), 


we conclude that the number of these equations is RAS I) 

These equations should be added to (3.110), eliminating from the combined 
set identities and equations which are linear combinations of other equations. 

The subsequent stages of the procedure are described in /9/, and no 
details are given here. 

Conditions (3.111) will be used in the next subsection. 


3.5.6. Minimum of the second variation of the functional 
in case of degenerate control 


Consider the system of differential equations (3.11), (3.12) which describe 


the case of degenerate control of degeneracy &. The integrand in the 
functional (1.1) may be written in the form /2/ 
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Fo=y? (t) f9(t,y,Ua), (3.112) 
P=1,...,.44+1. 
Equations (3.11) may be treated as an ordinary system of differential 


equations with linear control funttions y*. Solving (3.12) for y!, say, and 
inserting the result in (3.11), we find 


y=F=f (ty mdr FS (tyea)—F (yt) (3.113) 
FO= f(t, yt) -+ v8 (F(t, y,te) — f(f,ystt1)), (3.114) 
P=2, 3,.... A441, 


where y* are now independent and satisfy only the following constraints: 


yv>o; Dy<, (3.115) 


Suppose that a degenerate control is observed on some section, 
satisfying (3.27) with the zero closeness function y(t) =y(¢) and the basis 
control functions 


u=e (t), B=2, 3,..., R41, 
ye=y (1), B=2, 3,..., &+1, 


where y (tf) satisfy the strict inequalities in (3.115): 


+1 
¥'()>0; SPO<L (3.115a) 


Ba 2 


Adding small increments dy’ to the functions y*(t) such that (3.115a) are 
still satisfied for the incremented functions (y + 6y*) with unaltered 
ii, (t), dp (t), B=2, 3, ..., R+1, and fixed boundary conditions y(t;)=y(u), 
we obtain from equations (3.113), (3.114) the corresponding increments of 
the zero closeness function, the integrand function, and hence the functional 
relative to the values on( ¥(f), y*’ (t), @(1), wa (t)). For sufficiently small 
increments dy°, we can retain only the linear components of the increments 
y(t)—y(t), which are described by a linear system of variational equations 


2! =F, (1) 2' + Fy (A) v", (3.116) 
where 
e=y'—y'; Yay-¥, 
and investigate the sum of the first and the second variation of the functional 
to within terms of higher order. 
It is readily seen that the first variation of the functions is zero by (3.27) 


and it thus remains to check that the second variation is non-negative, this 
being the necessary condition for the minimum of the functional, 
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Let # be the Hamiltonian of (3.113), (3.114): 


H=)~F —P=H(t, 9, y, )— (H(t, ys, Us)-AE 4% a) (3.117) 


where 


A(t, Yy, %, uy=pf (t, Y, ay— fre, Y, nu) 


is the Hamiltonian of (1.3). The second variation of the functional is then 
written in the form (see, e.g., /11/) 


21 = — { (Bytys (O12! 2! + yt aen2'r?) ats (3,118) 
a eee oe ee eee 


The superior bar in (3.116) and (3,118) identifies the first derivatives of 
the vector function F and the function 4. Summation over repeating indices 
is implied. 

The problem reduces to investigating the minimum of the functional 
(3.118) under conditions (3.116) and the boundary conditions 


2(T,)=2(T)= 0. 


No restrictions are imposed on the variables z!, v8. 

This problem thus corresponds to the case considered in the previous 
subsection. To finally reduce this problem to the form (3.103), (3.104), we 
will formulate it as Mayer's problem, i.e., the problem of minimizing the 
final value of z for zero initial condition, supplementing (3.116) with the 
equation 


=H yy) (1) 2'2/ + Byt yp (t)e'v*. (3.119) 
Equations (3.110) now take the form 
La (9) == P20 Hey! zB (t) 2’ + 921 F 48 (t)=0. (3.120) 
Here 
0 q rf i Qa 
he= H,a027; y= Fe Q=s hy. 


Note that only 43 depends on the phase coordinates; the remaining hj 
are independent of z, so that their derivatives with respect to z? in the 
Poisson brackets all vanish. 

Inserting the expressions for h°, hi in (3.111), we find 


(Lp, Le= (FPA, [it —F 


These combinations should vanish identically in virtue of (3.111). But 
gz. ~0. Otherwise, condition (1.12) of Theorem 1.1 is not satisfied. We 
should therefore have 


tase “>a 
uy Ay uy A, 


P) see (3.121) 


P= 0, (3.122) 


tas 
uy H 


7A | -7 


y 
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Both terms on the right are scalar products of the vectors f and Hy. 
Now turning to equations (3.29) and (3.12), we express y! from (3.12) in 


terms of the other y: 
k+1 


v=1—> 9", 


p=2 


Inserting y! in (3.29) and collecting similar terms, we obtain 


Ae = — A, — [ad to (a) — Ay (un) 
a, P=2, 3,..., R41, 


and the coefficients of these equations coincide with the left-hand sides of 
(3.122) and should therefore vanish. The number of these equalities is“, 


and, as we saw in § 3.2, in virtue of (3.123) they lead to & further equalities 
4a ua Ug 
Ala, +[HJe f (u)) — Hy (am) fe =0. (3.124) 


Equalities (3.122) were obtained as the necessary conditions of a 
maximum of R and a minimum of © when minimizing the second variation in 
degenerate control problems. They were originally derived in /1/ as the 
necessary conditions for optimal degenerate control. 

Note that once conditions (3.122) have been verified, the problem of 
minimizing the second variation can be solved to completion by successively 
changing over to Problem 2. 


ale (3.123) 


say |4e 48 
v [Ales _ A, 


3.5.7. A more general problem 


Consider the same problem as in 3,5.2, using differential equations of a 
slightly more general form: 


y=glt. y, v) +h(t y, w), (3.125) 
where g(t, y,v) has the same properties as in 3.5.2, and A(¢, y, w) and the 


boundaries w,(t, y) and w2(t, y) from (3.45) are such that the functions 
h(t, y, w(t, y)) and A(t, y, we(t, y)) are continuous and continuously differen- 


tiable with respect to ¢ and y for all ¢ E(t, #1), yCVy(t). 
The function & for this problem has the form 


R(t, y, 0, W) =Py8 (ty, ¥) +oyh(E y, W)—P (ty, ¥) +O. (3.126) 
We choose 9(t, y) so that for all ¢€ (¢, #1), yC Vy(t) the function R 
attains its absolute maximum on two values of w, w(t, y)and we(f, y), 


Rit, 9, ¥%, Wi(4 Y)) =R(t, Y, , We(t, y)) =sup R(t, y, v, w). (3.127) 
wEVey (ty) 


We see from (3.127) that only the term 


Py A(t, y, w) 
depends on w. 
Condition (3.127) reduces to a certain combination of inequality 


constraints 
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Py C Q(t, y), (3.128) 
where Q(t, y) is a set in the space of vectors g,, and the condition 


gy (ACE, Y; w(t, y))—ACE, yY, Bo(t, y))=9, (3.129) 


which is a linear partial differential equation for the function @. Its general 
solution is an arbitrary continuous and differentiable function of the form 


e=9(t, n), 
where n(t, y)=(n', n2,..., n%') is the set of independent first integrals of 
the system of ordinary differential equations 


h(t, y, w(t M— h(t H Walt W) (3.130) 


with ¢ treated as a parameter. 
Let A,;=supR. Then 
weVva 


Ri=F1y Dwe' (4, y, UN. 
T 


We will further investigate the maximum of the function R, for every 
t € (to, f;) on the set of points ( n, y, v) satisfying conditions (3.45), (3.46) and 
the additional conditions 


n=n(t y). (3.131) 


It is further assumed that the function g(t, y) satisfies (3,128). ne 

Suppose that such a function g(é, y) exists and a solution (y(t), y(t), o(t)) 
has been found which under the given conditions ensures an absolute maxi- 
mum of the function R, for every ¢ € [t, 41]; moreover, conditions (1.12) 
and (3.128) are also satisfied. Suppose that this solution satisfies the 
following system of differential equations: 


y=vi(g lh Y; v)—Alt, y, @1))+ vo(g (t, yY, v)+ . 
+h(e, y, W))=g (8, y, v)+vy, h(E, y, wW,)+vwA(E, y, Wo), (3.132) 
where v,(f), ve(¢t) are some piecewise-continuous functions, 
vi (t) +ve(t) =1. 


If substituting the solution (f(4),%(¢)) in this system we find that v, ve 
satisfy the conditions 


v1,2(t) >0, (3.133) 


then (¢, 7) © w and the solution constitutes a zero closeness curve of a 
sliding control with the basis control functions wj(t, y), we(t, y) and the basis 
vectors 
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a,=(1, Fa ¥, 2, w,)); (3.134) 
a,=(1, Fé, YY; v, W.)). 

Thus, if all the above conditions are satisfied, our solution defines the 

sought minimizing sequence and the problem is solved. If no function 
g(t, y) exists and no solution (y(t), y(t), u(t)) satisfying all the above con- 

ditions, we conclude that the sought optimal control may consist of sections 
of different types — boundary sections, Euler sections, sliding control 
sections. On sliding control sections, the function (4 y) may be defined 
by the method that we described. 

As ina problem with linear control functions w, we can change over to 
Problem 2 as follows. 

The initial equations (3.125) are supplemented with the expressions for the 
the total derivatives of y(t, y) with respect to f: 


A= Wy (ei (4, yy VATE, y, W)TM. (3.135) 
1 


Equations (3.125) and (3.135) considered jointly constitute a system of 
(2n—1) equations where, in virtue of (1.131), only n equations are independent 
and which is obviously equivalent to the initial system (3.125). 

We further consider the problem of minimizing the functional (3.43) on 
the set L of the elements (y(t), y(t), u(t), w(t)) which only satisfy equations 
(3.135) and (3.131), where 7! (¢) are new phase coordinates and y, v, w are 
new control functions. 

The remaining conditions (3.45) —(3.46) do not change. The set of new 
control functions (y(t), u(t), w(t)) is designated D,. Repeating the same 
arguments as in 3.5.2, we readily see that D, DD and 

inf /=inf/<inf/. 
D, Di D 

Suppose that Problem 2 has been solved, The corresponding optimal 
control #(t) may take on values corresponding either to one of the boundaries 
@, @ of Vy or to its interior points. In the former case, both w, and 
W2 are optimal values, #=w),2. Indeed, the functions 1 (¢, y) are 
solutions of the partial differential equations (3.129) (a property of the first 
integrals of (3.130)). Hence it follows that the right-hand sides of (3.135) are 
not affected when w, is replaced with we and vice versa, so that n remains 
unchanged. The degeneracy in this case is k=1 and the functions 
G(2), 3(t), B(t) should only satisfy equations (3.135) and condition (3.45); 
7(¢), 5(4), @(t) can be approximated to any accuracy by constructing an 
appropriate sequence in D (sliding control). The solution of Problem 2 
can be constructed in D under certain boundary conditions. In the second 
case, it is not the properties of y(t, y) that cause non-uniqueness of @%(f). 
Asarule, @(t) is unique in this case and, when substituted with #(¢) in 
the original system (3.125), it defines under the appropriate initial con- 
ditions a unique solution y*(¢), which does not necessarily coincide with 
the (#) derived from other conditions. In this case, the solution of 
Problem 2 in general cannot be constructed in D. 
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Chapter IV 


SOME PROBLEMS OF POWERED 
FLIGHT OPTIMIZATION 


§ 4.1. VERTICAL ASCENT OF A ROCKET IN 
VACUUM 


The present section deals with a relatively simple problem of rocket 
dynamics, namely that of reaching extreme (maximum) altitude in one- 
dimensional vertical motion in vacuum (airless space). Constant 
gravitational acceleration is assumed. This problem may be treated as a 
simplified model of the general optimization problem of rocket maneuvers 
in a homogeneous field, which is solved in the next section. The method of 
solution and the specific properties of the solution of the general problem 
are conveniently illustrated in this simple case. 

The equations of the vertical motion of a rocket with controlled thrust 
may be written in the form 


1 
h'=~——V; (4.1) 
; p 
(Se AN a en 
Dei aS (4.2) 
) 
ie mt 4.3 
roa (4,3) 


where h is the altitude reckoned from the surface of the planet, Vis the 
velocity, tistime, misthe mass, cis the nozzle velocity, g is the 
gravitational acceleration (assumed constant), Bis the per-second rate 
of fuel consumption (the control element), O<P<Bmax. The independent 
variable in these equations is the current mass (a non-increasing function 
of time). 

The following boundary conditions are used: 


ho=0, Vo=0, fo=0 (4.4) 


for m=mp (the rocket starts from rest); the values of v, and ¢, for m=m, 


are not given. 
Our problem is to find the motion reaching the maximum altitude at 
the end of the powered flight (i.e., minimize (—A,)). 
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The solution of the problem is divided into two stages: first we solve 
the problem for any fixed ft; such that 


(ti—lo) > (to-—)) 1/Bmax, 


and then carry out optimization with respect to ¢,;, which yields the final 
solution. 
The functions R and @® of our problem are written in the form 


Sof ele gee EN ht : 
R=a(— TV) to (—St+ 48) % 5 ten (4.5) 
B= —h+9(m, My, Vi, 4). (4.6) 


Note that a may be treated as a linear control function and equations 


(4.1) —(4.3) have the same form as the equations treated in 3.5. 2. 
We will use the method of multiple maxima. Solving the partial 
differential equation 


—oV + 9g —% =, (4.7) 


we obtain the first integrals § yn of the characteristic system 


Se. eg, (4.8) 
at dt 
namely 
t=V +21, n=h—Vt—— gt. (4.9) 


The general solution of (4.7) is an arbitrary continuous differentiable 
function 9) (& n, ™). Inserting this function in (4.5), (4.6) and changing 
over to new variables § and n, we find 


Ry = (91 — Fgh) (=o) + mi (4.9a) 
= —n—e= (edt ber(my, by M). (4.10) 
The function @, is given in the form 
P= HEE + Pay, (4.11) 
where pe), Pn) are some constants. Then 
Pim =9; PEe=Heys Pig=Po, (4.12) 
and (4.10) takes the form 
®,= —1,— b+ 408 +4 +> af. (4.13) 
Taking %)=4, Y)=1, we find 
®, => gti=const = min ®, (4.14) 
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for any &, m1. The function R; takes the final form 
R= —(h—-)— ‘ (4.15) 


Let us investigate the minimum of this function* for any fixed mE€ (mm, mo). 
The minimum is clearly obtained at the lower limit value of f¢, i.e., t,(m). 

The extreme values of ¢ over the interval (tm, m,), i.e., t,(m) and 
ty(m), are obtained from equation (4.3) with the appropriate boundary 
conditions and the constraints on 6. The lower limit (f,(m) is the solution 
of equation (4.3) for B=fmax which passes through the point (smo, to), and 
ty(m) is the same solution passing through the point (mm, 4) (Figure 4.1). 

Indeed, no solution of (4.3) passing through (mo, fo) may lie below 4,(m) 
for any m<mp, and no solution passing through (m,, t) may lie above ty(m) 
for any m>m,, since this would necessitate B>Bmax. The two straight 
lines t.(m) and ty(m) are parallel. 

This concludes the first stage of the solution. The resulting motion 
starts from the initial point along the lower limit ¢=fy(m) and then 
abruptly "jumps" at m=m, from {¢,(m,) to #,. In other words, maximum 
thrust ®p=fmas is maintained until all the fuel has burnt out, followed by a 
coasting stage until ¢). 


--be 
My Mom 


FIGURE 4,1 


Here t(m) is a discontinuous function. We can restrain the solution to 
remain in D by replacing the vertical segment issuing from the point 1¢, (see 
Figure 4.1) with an inclined segment extending to intersection with f,(m) 
and making the slope factor of this segment go to infinity k—-co. This is 
achieved by constructing a control sequence with B—O over a finite length 
of time. This sequence satisfies all the conditions of Theorem 1.1. 

Indeed, condition (1.12) is satisfied in virtue of (4.14) and condition(1.11) is 
similarly satisfied, since on this sequence the function R defined by (4.15) 
is minimum everywhere, except a set of points m whose measure goes 
to zero. 

To find the optimum value of ¢, (the solution of the second stage of the 
problem), it suffices to integrate (4.1) —(4.3) in order to obtain the 
dependence of fA; on ¢, and then find the maximum of #,. It is readily seen 


* The condition of maximum R (Theorem 1.1) is replaced, as is readily seen, by a condition of minimum 
when a decreasing argument is substituted for an increasing argument. 
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that the sought ¢; corresponds to V,;=0, and the maximum value of h, is 
given by 


fy=a_(c In TY — 70 cin M2 + -& (my —m). (4.16) 
“& my Binax m, — Bmax 

The reader will be able to apply the same method to solve the problem 
of soft vertical landing with minimum velocity on the surface of a planet 
without an atmosphere, for given initial conditions. 


§ 4.2. OPTIMUM MOTION CONTROL OF ROCKETS IN 
A HOMOGENEOUS GRAVITATIONAL FIELD IN 
VACUUM 


Various problems of rocket dynamics are concerned with maneuvers 
which must be performed within a relatively small region of space and in 
a comparatively short time, so that the real gravitational field acting on 
the vehicle may be treated as constant (the homogeneous field approximation: 
the real field depends in general on the position of the spacecraft and the 
time). 

For maneuvers performed near the ground within a sphere of some 
100 km radius, the relative error of this approximation is about 1.5%, 

As the rocket moves farther from the primary center of attraction, the 
error decreases (other conditions being constant), so that a midcourse 
correction on a lunar or an interplanetary trajectory may be treated as 
taking place in a homogeneous field. This significantly simplifies the 
analysis and leads to effective solution of numerous optimum problems of 
rockets dynamics /8, 9/. 

The general structure of optimum motion control in a homogeneous 
field has been investigated by a number of authors /2, 3, 11/. 


4.2.1. Statement of the problem 


The motion of the center of mass of a rocket powered by a thrust vector 
in vacuum in a homogeneous gravitational field is described by the following 
set of equations: 


re V; (4,17) 
c 1 
te ge ES ot rs 4.18 
V age (4.18) 
fe et (4.19) 
8 


where r=(r', r?, r3), V=(V!, V2, V4) are, respectively, the radius vector 
and the velocity vector of a point in the inertial frame of reference; 
g=(g', g’, g’) is the constant gravitational acceleration vector; pis the unit 
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thrust vector (a control element); fis time; mis the mass (independent 
variable); fis the rate of mass consumption (another control element); 
c is the nozzle velocity. In addition to (4.17) — (4.19), the admissible 
motions also satisfy the constraints 


O<P<f,,., Where 8,,,<00 (4,20) 


and the boundary conditions 
(ro, Vo) E B (19) (4.21) 


for fy=%y¢ and m=sm, and 
(7. Vi)E Bm) (4.22) 
for 4;=t,and m=m,. 


Here B(m) is a set in the vector space (r, V). The set of control 
functions 


z(m) =(r(m), V(m), t(m), p(m), B(m)) 
satisfying the above conditions will be designated D. Our problem is to 
find a control function Z(m) in D on which the functional — some function 
F(r,, V;) — attains its minimum. 


If the sought control function Z(m) does not exist in D, we will have to 
construct a minimizing sequence 7Zs(m) in D. 


4.2.2. Transition to Problem 2 


We construct the functions 
R(r, Vy p, B m=e (-+)v i a(—— p—s8)+ ° (-+ 42 (4.23) 
’ ? vr é m 3 vt fl . m: 


Miro. Vu. Ts Vi= Fir, Vi) +e, Vi, fh, )— ¢(To, Vo, 40, 1M). (4.24) 


The first two terms on the left in (4.23) are scalar products of three- 


dimensional vectors. 
The function ¢ is defined as the general solution of the partial differential 


equation 
eV + Sg + =, (4,25) 


where the left-hand side is the coefficient before ; in expression (4.23) for 
the function R. The solution of this equation has the form 


P= (6, 1, m), (4.26) 
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where £=(E!, £2, £5), n=(n!, 2, n3)) are the vector first integrals of the 
characteristic system of equation (4.25): 


av 
=e (4.28) 


It is readily seen that the characteristic equations in this case describe 
the coasting of a point in a homogeneous field. 
The first integrals &, n are given by 


f= V — gt; (4.29) 


n=r—Vit— ge, (4.30) 


Inserting the solution (4.26) in expressions (4.23), (4.24) for R and ® 
and changing over to new variables, we obtain 


ae R= —— [9-948] P+ Pm (4.31) 


®,=Fi(h. 1, A+, 4, 7%1)—91 (6, No, Mo). (4.32) 


The functions R, and ®, require further investigation. In accordance 
with the theory of 3.3.2, this transformation of the functions R and M may 
be interpreted as a transition to Problem 2, which calls for minimization 
of the same functional over the set D, characterized by the same system of 
boundary conditions and constraints and a new system of differential 
relations: 


v= p, (4.33) 
m 
7 = — pf, (4.34) 


where the components of the vectors & 9 play the role of phase coordinates, 
and the components of p and the time ¢ are the control elements. If 
equations (4.33), (4.34) are supplemented with equation (4.19), the resulting 
system will be equivalent to equations (4.17)—(4.19). Suitable transfor- 
mation of coordinates will move one system into the other. The expressions 
for the functional and the boundary conditions change accordingly under this 
transformation. Since (4.33), (4.34) contain fewer constraints than 

(4.17) —(4.19), the set D, is wider than D, i.e., D,;2D, so that 


inf /< inf /. 
Z < nf (4.35) 
We will show in what follows that the equality sign applies in (4.35), i.e., 
the initial problem may be replaced by Problem 2. 


Note that (4.33), (4.34) have a peculiar characteristic: the right-hand 
sides of these equations are independent of the phase coordinates. 
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The necessary conditions for a minimum of R are the following: 


Ru(—, 0, p, £ m)=0> ve) =0; (4.36) 

Rin (&, 1, D, t, m)= 0 Hq) =0; (4.37) 

R,(E, 1, p, i, m)=ink R(E, 7, p, #, m), (4.38) 
PtElEL tu] 


where 
dey =9e(m, Fn), 
$n =, (m, § 1). 


Here £(m), a (m) are the values of £ n along the sought optimal 
solution. 

Relations (4.36) —(4.38) correspond to the adjoint system and the condition 
of minimum of H (a function with a decreasing argument) in Pontryagin's 
maximum principle. These equalities thus give the necessary conditions 
of optimality. By (4.36), (4.37), &%, are constant (since the right-hand 
sides of (4.33), (4.34) are independent of the phase coordinates). 

The function H has the form 


H=—~ p(}a—n!). (4.39) 


The minimum of H is evidently attained for 


— 1 or Yeo! (4.40) 
p= t—2 a 
| ¥ ay Heap AI 
and it is equal to 
Fi c 
y= DE — Fy te —bnl= 


aa a V (Ye)? = Be Vené + (Yom)?! (4.41) 


FIGURE 4,2 
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The minimum of HA, with respect to ¢ coincides with the maximum of the 
radicand in (4.41), which is a quadratic trinomial in ¢. If p, #0, the 
coefficient before /? is positive, so that the radicand in (4.41) can attain 
a maximum only for extreme values of /, i.e., ¢i(m), ty(m) (Figure 4.2). 

Comparison of the values of H, at the lower and the upper limits yield 
the following conditions for optimum selection with respect to ¢: 


t(m) for Soro + _tulm) + tym) 


(Ya? 2 
= ty(m) for Vator & tlm) + tym) (4.42) 
(Yeu)? 2 


Since the left-hand side of the inequality in (4.42) is constant and the right-hand 
side is a monotonically decreasing function of mass, the motion from mp 
to m, (in the direction of decreasing mass) can have at most one control 
change, necessarily from the lower limit ‘.(m) to the upper ty(m). 

In particular cases, depending on the values of pe) 4,,), control functions 
without abrupt change are possible: either t=f.(m) or t=fy(m). 

For ny=0, the function H; is independent of ¢ andthe condition of 
minimum of H, with respectto ¢ is satisfied by any piecewise-continuous func- 
tion {(m) varying between the fixed limits. This case is observed when either 
the initial or the final value of the vector is free. 

If the function F; is concave and the boundary conditions are fixed or 
defined by linear forms in terms of the initial and the final values of the 
coordinates (the components of the vectors &, No, &, 1 ), the necessary 
conditions (4.36) —(4.38) are also sufficient, since the right-hand sides of 
(4.33), (4.34) are independent of the phase coordinates. Therefore each 
solution satisfying the above conditions is of necessity optimal. In par- 
ticular, if the limit value yn, or no of the vector n is free, the optimal 
solutions are inherently not unique. Any combination of the control 
functions 


where Pit) is selected so that the boundary conditions are satisfied and 
t(m) is any function falling within the admissible limits, is an optimal 
solution of Problem 2. This can be verified directly, by eliminating 
equation (4.34) from system (4.33), (4.34) and considering the problem for 
equation (4.33) only, whose right-hand side is independent of ¢. 

In the general case of nonlinear boundary conditions and nonlinear 
functionals, conditions (4.36)—(4.38) are only necessary. To find the 
optimal solution in this case, a further analysis is needed (e.g., further 
investigation of R). However, since conditions (4.36), (4.38) are 
necesSary, they describe the structure of the optimal solution. 
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4.2.3. The solution of Problem 2 in the original 
set. The structure of optimal control functions 


The solution of Problem 2 belongs to the set Dij> D. If this solution in 
the (f, m) plane satisfies equation (4.19) with the given constraints on 6, it 
also belongs to the original set D and is thus a solution of the original 
problem. It is readily seen that solutions of the form shown in Figure 4.2 
(the case ») #0) satisfy equation (4.19). The sections of the upper and 
the lower boundaries correspond to B=Bmax (motion with maximum thrust), 
and the jump at the switching point corresponds to B=0 (coasting with the 
thrust off). The direction of the thrust vector along the powered sections 
is determined by equation (4.40). If we strictly confine the solutions to the 
class D, which corresponds to continuous functions t(m), the solution of 
the problem is provided by a sequence which in the (¢, m) plane (Figure 4.2) 
corresponds to a sequence of the functions {(m) obtained when the sudden 
jumps at the points mo, m,, m, are replaced by inclined straight sections 
with the slope factor going to infinity (as in § 4.1). 

The case  (,,=0 requires special consideration. The solution of 
equations (4.36) —(4.38) in this case incorporates all the piecewise- 
continuous functions f(m) satisfying the constraints t,(m)<t(m)<ty(m) on 
[m1, %] and the boundary conditions fy, ¢t;. These functions may include those 
which do not satisfy equation (4.19). If the conditions are nonlinear at the 
two end points, further analysis is required in order to find the optimal 
solution of Problem 2. There is danger, however, of ending up with an 
optimal solution which corresponds to a function ¢(m) that does not satisfy 
equation (4.19). 

In order to assess the imminence of this danger, let us consider the 
solution of system (4.33), (4.34) for w_)=0. By (4.40) we have 


= saan, (4.43) 
Hel 
Hence 
§,=ty+ pinZ, (4.44) 
my 
= M+ pf ‘Gors, dm. (4.45) 


We see from (4.44) and (4.45) that the vectors (£,—&) and (n:—no) are 
collinear with the vector p and the ends of the vectors (ni— yo) define a 
segment in the direction p with the end points 


aa (4.46) 
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The sought optimal value of (ni—no) must lie inside this segment. If it 
coincides with one of the end points, the corresponding function f1(m) or 
ty(m) is unique and is represented by equation (4.19), If, however, it lies 
inside the interval, there are infinitely many solutions ¢(m) satisfying the 
only condition 


my, 
p(n —T)= —~e f 4) dm. (4.47) 


Mo 


Among these solutions, we can always choose a set of functions (¢(m) 
satisfying equation (4.19) and the appropriate boundary conditions. A 
suitable function f(m) may be sought in the form 


1 


iin) i m+ta, (4.48) 


max 

where the constant a is readily obtained from (4.47). It follows from the 
above that in any case the solution of Problem 2 incorporates the solution of 
the original problem, which is readily derived from the former. We thus 
indeed have the equality sign in (4.35). 


4.2.4, Generalization to the case of free end points 


The above results significantly depend on the construction of the limit 
values of ¢t, which can be done fairly easily for the case of fixed end points. 
The solution of the problem with free end points belongs to the set of 
solutions of problems with fixed end points and therefore has the same 
structure as before. The additional conditions of optimum for mp, to, ™ can 
be obtained by varying the general solution of Problem 2 with fixed end 
points with respect to these parameters. The only exceptions are the 
problems seeking minimum ¢, and maximum m,: the former since ¢ isa 
control element in Problem 2, and not a phase coordinate, and the latter 
since problems with fixed m, are meaningless. The corresponding problems 
can be solved by introducing new variables 


t=at; m=bp, 


where t, p are the new variables (with fixed end points), and a and 6 are 

introduced in (4.33), (4.34) as additional phase coordinates with the 

appropriate supplementary equations a’=0, 6’=0. It is readily seen that 

the structure of the sought solution remains unchanged in this case also. 
Indeed, the equations of Problem 2 in this case take the form 


g =—— Pp, 
»& 
a pat; (4.49) 
a'=0; 
b'=0, 
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where the derivatives are with respect to », and the upper and lower 
limits of t (ty(n) and t,(u)) are constructed using the equation 


t b 1 
v= — 

a B 
with fixed boundary values 1, Mo, ti, Mi. They are described by the 


equations 


tLe) =t)— — = (u— Po); 
max 4.50 
TyHM=T =. eer: 


max 


The functions H and H, for this system are obtained by substituting at 
for ¢ in (4.39) and (4.41). Since the right-hand sides of these equations 
are independent of € and n, the vectors pity and ,) are again constant 
along the optimal solution, so that all the previous conclusions remain in 
force. 

The new system of equations is supplemented with equations for ya) 


and pi), 


0a ot Oa 


where the terms with the derivatives “Vand “iY account for the depen- 
a 


dence of the constraints for t on a and 6. When 1+ takes other than one 
of its limit values, these derivatives vanish. For the limit values of t, 
the equation of the corresponding boundary is differentiated, 

These comments are not to be regarded as practical recommendations 
for the solution of optimum problems. Their aim is to clarify the general 
structure of the solution. In practice, these problems may sometimes be 
solved by considering the conjugate problems, namely the maximization of 
the coordinate for fixed end points fo, mo, mm, hi. 


4.2.5. Singular control 


Sometimes the condition of maximum of H with respect to u does not 
permit identifying a finite number of potential optimum programs. This 
situation is encountered, e.g., when one of the control functions enters the 
Hamiltonian in linear form and its coefficient vanishes in the optimal 
case. The corresponding cases are known as singular control (in 
the sense of the maximum principle). The necessary conditions of the 
maximum principle for the original problem often do not permit reaching 
definite conclusions regarding the optimality of these control functions, 
and this in its turn interferes with the elucidation of the general structure 
of the optimal control. 


Let us consider this aspect in more detail. We will use the equations 
of motion in new variables (4.33), (4.34), (4.19), which are equivalent to 
(4.17) —(4.19). 

The function H for these equations has the form 


=— (odor) = p+} > (4.51) 


where by the conjugate system pc) and ,) are constant and ‘pjq(m) is 
described by the equation 


, oH ¢ 
go> aia, p. (4.52) 


Here wu)(m) is the control switching function. 
Along the singular control, p is given by (4.40) and 


dry = 0. (4.53) 
By (4.52), condition (4.53) leads to 
(Pc) — Pint) Pim =O. (4.54) 
The last equality is satisfied under the following conditions 


beey¥enr 
2 
(4) 


1)  ¢=const= ; (4.55) 


2) Yq =0. (4.56) 
Condition (4.55) corresponds to : =0 (pulsed control). This is a case 


of so-called pulsed singular control. Its optimality can be elucidated by in- 
vestigating Problem 2 analogous to the original problem, with / treated as 
a control element. It is readily seen that (4.55) corresponds to a stationary 
point of the quadratic trinomial in ¢ in the radicand in (4.41). Since the 
coefficient before ¢ is positive, the Hamiltonian has a maximum (and not 
a minimum) with respect to f in Problem 2. The conclusion is that the 
singular control of the type being considered is not optimal. 

We have seen before that the singular control functions satisfying 
condition (4.56) may be optimal and as a rule they are not unique. The 
structure of optimal control in this case has been fully elucidated. 


4.2.6. Generalization to the case of multistage rockets 


The preceding considerations can be generalized to the case of multi- 
stage rockets when the engine of each stage has its own maximum thrust. 
The argument in this case is conveniently chosen as the characteristic 
velocity, which unlike mass is a continuous function of time: 


t 
od errs dat, 
y m (t) 
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Equations (4.33), (4.34) take the form 


g_y,. 4 oy 
oe ag pt, (4.57) 


and equation (4.19) is written as 


i i ah (4.58) 
dus cB (w) 7 , 


where m;(u), B(u), u;(u), c(u) are known piecewise-constant functions of 
u (ui, m;are the characteristic velocity and the mass of the remaining part 
of the rocket when a stage is jettisoned). 

It is readily seen that expression (4.40) for the optimal control function 
p and the radicand in H; do not change in this case. Only the region of 
admissible values of ¢ changes. It is constructed along the same lines as 
before; its form is shown in Figure 4.3. Both the upper fj(u) and the lower 
ty(u) limits of ¢ for u € (uo, ui) are solutions of (4.58) for ¢; ¢,(u) passes 
through the point (fo, uo), and fy(u“) through the point (f1, “4:). Since t(u) 
and 4,(u) are monotonic functions and the interval (4L(z), ty(u)) is fixed, 
the previous results remain applicable. 


FIGURE 4.3 


The above results are generalized to the case when the original equations 
have the form 


, 


f= — 


(w+ fi (4); 


me 
8 
w= p—— fall) (4.59) 
lh 
8 
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where f(t), fe(t) are some piecewise-continuous vector functions such 
that the integrals 


fr@ae § falnae 
to to 


exist. In this case, only the expressions of the first integrals & n in 
terms of r, v, t change. The equations of Problem 2 and the conclusions 
relating to the realization of its solutions and singular controls remain 
unchanged. 

The general variational problem of the dynamics of a point of varying 
mass in vacuum in a homogeneous gravitational field thus has been reduced 
completely to a new Problem 2 for system (4.33), (4.34), with the mass 
appearing as the argument, the coasting integrals of the point mass as the 
phase variables, and the thrust direction p and time fas the control 
elements. The constraints on time are provided by equation (4.19) 
and the boundary conditions for f and m. The right-hand sides of 
(4.33), (4.34) are significantly independent of the phase coordinates; 
as a result, the adjoint factors are constant. The investigation of 
Problem 2 gave the following results. 

1. Optimal control in general may comprise two powered sections of 
maximum thrust separated by a coasting stage. In the limit, with 
Bmax oo, this corresponds to a case of two-pulse motion, when the first 
power pulse is applied at the initial instant and the second pulse at the 
final instant. 

2, With special boundary conditions, when the final value of the 
vector 1 is either free or is fixed so that the vector increments 1:—v7o and 
—§:1\—£) are colinear, the optimal solution is not unique. In terms of the 
maximum principle, these solutions constitute the singular control for the 
original problem. 

3. Pulsed singular control is non-optimal and is not included in the 
structure of optimal solutions. 

4. The structure of optimal solutions described above is fundamentally 
independent of the number of stages, even if the maximum thrust potential 
of each stage is different. 

5. The equations of optimal control can be readily integrated analytically. 
Any particular optimization problem for motion in a homogeneous field 
thus may be solved as a problem of minimum of a function of a finite number 
of variables, although this is not a general recommendation. In a number 
of cases, proceeding from the specific features of particular problems, the 
expressions of the control functions may be markedly simplified before 
integration, and the final result is thus obtained by a shorter path. 


§ 4.3. VERTICAL ASCENT OF A ROCKET IN THE 
ATMOSPHERE TO MAXIMUM ALTITUDE 


The present section is devoted to the solution of a problem presented 
in Introduction (ExamplesI.1 and I.2). This problem of rocket dynamics 
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has been treated by a number of authors /4,10,13/. On the one hand, it 
is of immediate practical interest for programming the thrust of sounding 
rockets, and on the other hand, it may be regarded as a model of more 
complex, two-dimensional and three-dimensional problems of rocket 
dynamics. Proceeding from the considerations in the Introduction, we see 
that solutions satisfying Euler's equation are not necessarily optimal. The 
analysis that follows not only provides an illustration of the technique but 
actually solves the problem in full form and proves the optimality of the 
solutions. 

Let us briefly reiterate the problem. Find the motion reaching the 
maximum altitude among all the admissible motions satisfying the conditions 


h=V; V=—— (X(t, V)— P)— g(t) 
Rae Bie: (4.60) 
A(0)= fo; V(O)=Vo; m0) =m; 0< P<oo, 


where A, V, m are the altitude, the velocity, and the mass of the rocket; 
X(h, V) is the drag; Pis the thrust; g(h)is the gravitational acceleration; 
c is the nozzle velocity. 

As is readily seen, the equations of motion (4.60) contain the controlled 
thrust P in linear form, so that the method of multiple maxima can be 


applied. 
The functions R and @ for this problem take the form 


Raa +4 | — 2X (h, VE (i) +Plm|—eqPle+ a4 (4.61) 
O(4y, Vy, AHA teh, MM, A1,Vi1)— (lo, 1%, Mo, Vo). (4.62) 


The equation of multiple maxima in this case reduces to 


$v — m= 0 (4.63) 
m 
and its general solution is 
o= 7 (4, A, nV, m)), (4.64) 
where 
nV, m)=V—cln7>. (4.65) 


The physical meaning of the first integral yn is the difference between 
the actual velocity and the characteristic velocity of the rocket. 
The equations of Problem 2 have the form 
h=V: = —— X(t, V)—g(h). (4,66) 


Seeing that the time of motion is not fixed and y is a monotonic function 
of time (1 is strictly negative by (4.66)), we may change over to a new 
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variable, dropping the equation for ¢. The problem thus reduces to in- 
vestigating the one equation 


an ~, 4 = f(h, 1, m), (4.67) 
4 Tn eV) + 8A) 


where 
V=ntein™, 
m 
Now, applying the maximum principle, say, we derive conditions which 


exhaust the solution of the following Mayer problem: for every fixed 7 from 
the interval (11, No) 


f(a), ym(ny= int f(A (n), 0, m) (4.68) 
m,sme<m, 
and at the final instant 
f (Ay. m,)=0. (4.69) 
By (4.69), Vi=0, i.e., 
Ue In mo 2 
my 


The limits m,(n) and y(n) are constructed using the equation 


MO oe 2 PIE os (4.70) 
dn , 3 


1 
Ate) 


which is obtained by changing over to the new variable yn in the original 
system (4.60) and in the boundary conditions. 


FIGURE 4.4 
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The constructions in this case are entirely analogous to those considered 
in § 4.1. The upper and the lower limits my(n), m_(q) are the solutions of 
equation (4.70) for P=0 which pass through the points (Mo, qo), (m1. M1), 
respectively. They are shown in Figure 4.4. Evidently, mu(no) =o, 
m,(y1) =m. The solution of equation (4.68) for the general case is shown 
in Figure 4.4. Depending on particular boundary conditions and rocket 
parameters, all the three cases shown in Figure 4.4, a,b,c, can be 
attained. Case a corresponds to a combusion pulse in which all the fuel 
is burnt, followed by coasting to 41(Vi=0). Case b is a combination of an 
initial pulse with the fuel burning to some (no) then a stage of 
throttled thrust #%(n) which satisfies the condition 


of 9 (4.71) 


om 


until all fuel has burnt out and coasting to V=0. Finally, case c 

differs from the previous case in one respect only: the boosting pulse is 
replaced by deceleration down to a velocity V, corresponding to the 
condition 


m (N)= mmo. (4.72) 


If we are to remain strictly in the set of admissible solutions, the pulsed 
solutions should be approximated with sequences of control functions with 
the thrust increasing to infinity (curves 1, 2, 3,... in Figure 4.4), 

Condition (4.71) corresponds to the dependence derived in Examples I. 1 
and I. 2 by differentiating the equation of the control switching line M=0. 
This dependence, in its turn, is equivalent to the relations obtained for 
throttled thrust programs in /2, 4,10, 13/. 

It is readily seen that the solution of Problem 2 constructed in this way 
satisfies the original equations and therefore solves the original problem. 
This solution indicates that the singular control is optimal in the present 
case, 


§ 4.4. THE DEGENERATE PROBLEM OF THE OPTIMAL 
POWERED ASCENT TRAJECTORY OF AN AIRCRAFT 


Consider the motion of the center of mass of an aircraft in the vertical 
plane under the combined action of the gravitational forces, air drag, and 
engine thrust. The equations of motion are written in the form /7/ 


mV =P(h, V)—X (A, V, a)—mg sind; (4.73) 
mV6=P(h, V)a+Y(h, V, a)—mgcos6, (4.74) 


where h is the altitude, V is the velocity of the aircraft center of mass, 
mis the mass, 6 is the angle of inclination of the trajectory to the horizon, 
ais the angle of attack, i.e., the angle between the wing chord and the 
direction of the velocity vector, gis the gravitational acceleration, which 
is assumed constant in magnitude and direction, P(h, V) is the engine thrust 
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specified as a function of altitude and velocity, X and Y are the drag and 
the lift on the aircraft, respectively: 


X=c,(M, a) 2a S: 
(4. 75) 


Y =c,(M, a) 2a S. 


Here @(h) is the density of the atmosphere, S is the effective wing area, 
M is the Mach number, defined as the ratio of the velocity V to the speed 
of sound a(A) at a given altitude, cx, and cy, are the drag coefficients; the 
dependence of the drag coefficients on the angle of attack for flight in the 
dense layers of the atmosphere is expressed by the relations 


c,(M, @)= ¢,9(M)-+ 0(M)e?2, (4.76) 
cy(M, a)=cy (M)a. (4.77) 


Equations (4.73), (4.74) are written assuming sufficiently small angles 
of attack, and we may therefore take cosa=1 and sina~a. These equations 
should be supplemented by two further relations 


h=V sin; (4.78) 


m= —B(h, V), (4.79) 


where £ is the per-second fuel consumption (a known function of fA and V ), 
We will only consider ascent trajectories with 


O<a<n, (4.80) 


Horizontal flight with @=0 is allowed in the limit. 

The importance of this conditions for flight near the Earth is quite 
evident. For many types of aircraft, it is highly desirable over the entire 
section of powered ascent. Thus, using (4.78) and remembering that 
because of the inequality 4>0 we may eliminate the time ¢ from the 
equations of motion, we change over to the argument A and write 


— = ——_ - >; (4.81) 

dh mV sin§ V 

gb Pa+Y —gctge. (4.82) 

dh mV2sin® v2 ° . 
AO (4.83) 
dh V sind 


If the aircraft motion is described by these equations, the three numbers 
V, 0, A may be regarded as a phase vector, and the attack angle a assumes 
the role of a control element. 


The set V. of the admissible values of the control element a is defined 
by the inequality 


lal<v, (4.84) 
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where the number y>0 is chosen from considerations of aircraft stability 


and mechanical strength. 
Suppose that at the initial time f=0 the aircraft has known altitude, 


known velocity, and known mass, 


h(0) =H; V(0) =Vo; m(0) =m. (4.85) 


When at a given altitude h\> fo, the aircraft is required to have a given 
velocity Vi>Vo, i.e., 


Viti =V). (4.86) 


These boundary conditions define two sets V,(k=/ ) and V,(h,) of the 
admissible states of the aircraft at altitudes h=hy and h=h,. Let us 
determine the set V,(A) over the interval (Mo, 4). We assume that the mass 
of fuel consumed by the entire maneuver is sufficiently small, so that we 


may take 
m= mo= const. (4.87) 


The angle 6 according to the above is restrained to the limits 0<6<a. 
The velocity V should satisfy the inequalities 


Vi (h)<VSV2(h), (4.88) 


where the lower limit V;(A) is defined by the formula 


V,(A) for Vi(4)>0 \. (4.89) 


Vi(A)= 
0 for F(A<0 


V, (Ah) is the solution of equation (4.81) for a=+y, 6 = = with the initial 
condition V(4j)=Vo. The upper limit is expressed by the equality 


V2(h)=min [Vo (A), Va" (A)]. (4.90) 


FIGURE 4.5 
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Here 2(h) is the solution of equation (4.81) for a=+y, 0= 4 which passes 


through the point A), V; of the (4, V) plane (Figure 4,5), and V2*(f) is 
obtained from engineering considerations. Thus, considerations of 
mechanical strength demand that the dynamic head does not exceed some 
fixed value q: 

Q (a) V2 


7) <4. (4.91) 


The curves V=9,(h) and V=V2(h) are the projections of the "maximum 
braking" trajectory onto the (f, V) plane. This 'maximum braking" is 


attained for vertical ascent o=5] and maximum drag (a=+y). 


Physically, the condition (hk) <V<W2(h) means that an aircraft moving 
at an altitude A with velocity V(hk)>V2(h), say, will reach the altitude h, 
with velocity V(4,)>V,, thus violating the boundary condition V(éi)=V,. 

An element z of the set D of admissible trajectories is thus a com- 
bination of a piecewise-differentiable vector function y(h)=(V(hA), 6(h)) 
and a piecewise-continuous function a(h) defined on the segment [ho, Ay], 
where they satisfy equations (4.81), (4.82) and conditions 


y(AE Vy (A); | 
a(h)E Vo. 


(4.92) 


The last conditions are equivalent to the boundary conditions (4.85), 
(4.86) and inequalities (4.80), (4.84), (4.88). 

Among all the elements z€D, find the one which minimizes the 
functional 


Ay 
Bris aay ee 4.93 
/ (z)=mg—m(hy) J ait dh. (4. 3) 


In other words, we are looking for an admissible flight trajectory such 
that the ascent from altitude hg to fh; and acceleration from velocity V» to 
V, is effected with minimum expenditure of fuel. 

Before proceeding with a solution of the problem, we have to make one 
further assumption concerning the properties of the right-hand sides of 
equations (4.81), (4.82), which will further restrict the range of application 
of the problem but significantly simplify its solution. We will assume that 
within the limits specified by (4.84), the drag coefficient may be treated 
as independent of the angle of attack: 


c,(M, a) = c&(M)+6(M)a2=c8(M), (4.94) 


This assumption makes the problem degenerate (the control element a 
is now a linear component on the right in the relevant equations), and it 
can thus be solved by the special methods described in Chapter III. 

Thus following the general outline for the solution of variational 
problems from the sufficient optimum conditions, we introduce a function 


109 


(A, V, 0) which depends on the argument / and the phase coordinates V 
and @. We further construct the functions 


R(h, V, 8, =e me 4] sts 


mV sin@ V 
Peters! = ep Pe 4.95 
+4 a [Pat ¥ (ht, Vy @)— mg cos 8] — T+ 3 (88) 
D(Vo, 9. Vi, 9:)=9(Ay, Vi, 9)—9 (Mo, Vo, 9%). (4.96) 


We are looking for the vector functions (V(A), 6 (A), a(h))€ D anda 
function (A, V, 6) such that 


Rik, V (A), 8(4), a(A)J=p(4)= sup RA, V, 4, a); (4.97) 
a| 
Viner ae 
0<0< 
(Vo, O, Vy, :)= inf ® (Vo, Vy, %, 0). (4.98) 
eee 


In virtue of (4.75), (4,77), and (4.94), Ris linear in a. We may thus use 
the method of multiple maxima in order to find (hf, V, 6). If R is to be in- 
dependent of a, we should have 


pa(hk, V, 8)=0. (4.99) 
The general solution of this equation is 


g=¢,(h, V), (4.100) 


where 9,(A, V) is an arbitrary function of h# and V. 
Inserting p=q,(h, V) in (4.95) and (4.96), we find 


fae P~—X 1 ee ee : 
R(h, V, 9) =| ev Wi lan Piv V + Pai (4.101) 
D(Vo, %, Vi, 8)=91 (1, Vi) —-9 (Mo, Vo). (4.102) 


Condition (4.98) is now satisfied automatically for any 9,(A,; V), since 
®@ is independent of @ and 6,, whereas Vo and V, are fixed. 

The function R is now indeed independent of a, i.e., any a@ maximizes 
this function for fixed h, V, and 8. We see from (4.101) that by imposing 
an additional condition on p=, (h, V), 


ig SG, (4,103) 
m 
we make R independent of 0 either. Condition (4.103) imposed on q,(h, V) 


corresponds to a repeated application of the method of multiple maxima to 
our problem, since 9 in Problem 2 is a control element. 
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From (4.103) we have 


= mB (A, V) 5 10 
Piy= P(h, V)—X(h.V) (4.104) 
Vv 
4(h, y=(ey av; (4,105) 
Vo 
Vv 
Pin (Ay v)=m{ 2 [axl dV. (4.106) 
Vo 


The integral in (4.105) and (4.106) is evaluated for fixed A from an 
arbitrary lower limit Vo. Inserting (4.104), (4.105), (4.106) in (4.101), we 
find 


Vv 
R(h, V)= — les rraa ss j = (=e dV. (4.107) 


The right-hand side of this equality is a know function of A and V. 
Condition (4.97) for a maximum of R with the given @ defines the optimal 
dependence V(A): 

R(h, V(A))= sup RA, V). (4,108) 
VA) <V KV, (h) 

Here R(h, V) is given by (4.107). 

It follows from (4,108) that the optimal flight program V(h) for every 
fixed h either coincides with one of the limit values V,(A) or V2(A), or is 
a solution of the equation 


aa mgp a ee 
Ry (h, VJ — = se ralager peor 0, (4.109) 


or, finally, corresponds to a vertical jump (in the (4, V) plane) from one 
of these V(h—0) to another V(h+0) corresponding to horizontal flight. The 
order in which these pieces are ''matched"' is determined by equation 
(4.108), which fully describes the optimal program. 


FIGURE 4.6 
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Let us now consider the engineering computations of the optimal 
powered ascent trajectory proceeding from (4.108). 
1, Numerical integration gives the solutions V;(h) and V2(k) of equation 


(4.81) for the vertical ascent (= +). which pass through the points (Mo, Vo), 


and (fh, Vi), respectively. These solutions are plotted in the (h, V) plane. 
The curve of constant dynamic head q is plotted in the same plane, 


(A) V2 


2 q; 


and using (4.88), (4.89) we construct the limits V,(f) and V2(H) of the ad- 
missible velocities. 

2. For every fixed A € [Ao, Ai], we use (4.107) to construct the function 
R(h, V) on the segment [V,(4), V2(k)]. The integral in (4.107) is obtained 
by numerical or graphical integration. Figure 4.6 shows a specimen 
function of this kind. 

3. For every h, we choose a velocity V(h) corresponding to the maximum 
value of R(A, V) over [V\(A), V2(h)]. As a rule (see Figure 4.6), the function 
R(h, V) has two local maxima satisfying equation (4.109) (the subsonic and 
the supersonic extremals, respectively); for h<h* (low altitudes), 

Rfh, V'(hA)]>R[h, V"(A)], andfor A>hA* (highaltitudes), R[A, V'(A)]<R[A, V!(A)]. 
There may, however, be more than two maxima. 


N70 h h ‘7 h 


FIGURE 4.7 


The optimal dependence V(h) corresponding to the function R(h, V) ob- 
tained in this case is shown in Figure 4.7 for the initial conditions f4y=0, 
Vo=0. For h<h*, : 

V(h)=Vi(A), 
and for A>h’*, 
V(h) =min[V! (A), Vo(A)]. 


This V(h) corresponds to the following flight conditions. The aircraft 
first accelerates along a horizontal trajectory from V)»=0 to some velocity 
V!(0) at the same altitude A=0 (in practice, obviously, the flight altitude 
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is somewhat higher). The aircraft then climbs at the subsonic extremal 
velocity V=V'(h) to the altitude A=A*, where V!(h*)= V!'(A*). At this 
altitude, the aircraft again assumes a horizontal trajectory, accelerating 
from V!(A*)to Vil(h*), covering the entire sonic range of velocities. Then 
the aircraft climbs along the supersonic extremal V'!(#) until the limit of 
the dynamic head constraint is reached. After that it moves with constant 
dynamic head g, again along the supersonic extremal, and finally 
accelerating in the horizontal direction at altitude fA; to velocity V;. 

A different optimal trajectory V(h)is shown in Figure 4.8. Sections of 
this trajectory may correspond, in the limit, to vertical ascent of the air- 
craft. 


FIGURE 4.8 


Inserting the optimal dependence V(h)in (4.81), we find the corres- 
ponding optimal dependence 6(A). 

The kinks in V(#) correspond to discontinuities in 8(h), Strictly speaking, 
this solution, although exact in the sense of Problem 2, does not exist in the 
original class D, since the functions 0(A) in this class are continuous. 

A solution which belongs to D can be obtained if the above solution of 
Problem 2 is replaced in the neighborhood of the discontinuity points by 
solutions of the original system of differential equations with a=+y for the 
discontinuities where 0 increases and with a=—y for the discontinuities 
where 6 decreases. 

The solution formed in this way should be treated as an approximate 
solution in D, with the estimate |/,—/*|, where /) is the minimum value of 
the functional in Problem 2, /*is the value of the functional for the new 
solution. Since /2</, where / is the exact minimum of the functional on D, 
we have 


[f2—/*| > [I—I*]. 


If the left-hand side of this inequality is sufficiently small, the right-hand 
side is also small, i.e., the approximate solution is sufficiently exact and no 
better approximation need be sought. If the estimate |/.—/*| is large and 
does not satisfy the required accuracy, the above method cannot be applied in 
undiluted form to the solution of the particular problem, and it only providesa 
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qualitative description of the sought solution. In these cases, the result may 
be used, say, as the first approximation in some successive approximations 
scheme whereby the solution is brought to an end. 

Remark 1, A similar procedure can be applied to find a trajectory 
V(h) minimizing the maneuver time 


hy 
T=\ ae (4.110) 
; V sin6 . 


To this end, f(A, V) should be replaced with unity in expression (4.109) 
for the function R(h, V). 

Remark 2. The degenerate problem of the optimum powered ascent of 
an aircraft considered in this section played an important role in the develop- 
ment of optimum flight theory. It was first tackled by Ostoslavskii and 
Lebedev /6/ in 1946. They obtained the subsonic extremal V'(A), known as 
Ostoslavskii's curve. Miele /12/, by an ingenious application of Green's 
theorem, proved the optimality of trajectories containing one of the extremal 
branches and described the fundamental structure of the optimal trajectory 
in the case of multiple extremals. The question of the optimal position of the 
"jumps" (the horizontal flight sections) between the different extremals re- 
mained open, however. A complete solution of the problem by the method 
described in this section was finally published in /5/. 


§ 4.5. APPROXIMATE SYNTHESIS OF OPTIMAL 
ANGLE-OF-ATTACK CONTROL FORA 
LIQUID- PROPELLANT WINGED AIRCRAFT 


4.5.1. Statement of the problem. Equations of motion 


The object of our analysis in this section is a liquid-propellant winged 
aircraft normally flying in the dense layers of the atmosphere, at altitudes 
up to 20km. The motion of the aircraft is investigated in the wind system of 
coordinates under the following basic assumptions. 

1. The aircraft motion is considered in a vertical plane in the atmo- 
sphere of a non-rotating Earth with a homogeneous gravitational field. 

2. The aircraft is treated as a point of varying mass, since liquid- 
propellant engines are characterized by high rates of fuel consumption. 

3. Since the engine thrust is assumed to remain constant, the aircraft 
motion is entirely determined by controlling the angle of attack of the 
wings. 

4, The per-second consumption of the liquid propellant under conditions 
of constant engine thrust is independent of flight velocity and flight altitude 
/6/, i.e., the aircraft mass is a known function of time. 

5. The engine thrust is constant, and the direction of the thrust vector is 
along the longitudinal axis of the aircraft. 
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6. The polar curve of the aircraft is a family of quadratic parabolas 
of the form 


€,=Cy9(M)-+¢ (M)o?; 
iis (4.111) 
Cy= A(M)a, 
where c, is the drag coefficient, Cxo is the component of the drag coefficient 
for zero angle of attack, cis a coefficient which allows for the lift-drag 
coupling, ais the angle of attack, M is the Mach number. 

The functions Cs=[(M); c=f(M); A=f(M) are represented as polynomials 
in M, 

7. The aerodynamic forces on the aircraft are determined assuming 
stationary conditions. 

8. The limits of variation of the angle of attack are sufficiently small and 


we may therefore take 
a2 


sina =a; cosa 1——-. 


The equations of motion of an aircraft in the wind system of coordinates 
under the above assumptions take the form 


v= [P—X,(V, 4) —(0.5P-+.X,(V, h))a2— mg sin 6}; (4.112) 
i= {[P-+Y (V,h)] a—mg cos 6}; (4.113) 
h=V sino, (4.114) 


where V is the flight velocity in km/sec, his the flight altitude inkm, 6 is 
the angle of inclination of the flight trajectory to the horizon in rad, Pis the 
engine thrust, Xo(V, 4) is the drag for zero angle of attack, X;(V, 4)a? and 
Y(A, V)a are, respectively, the induced drag and the lift. 

The atmospheric density and the speed of sound as a function of altitude 
are described by the following relations: 


20—h 


; a(A4)=0.340 V 1—0.02255h . 


h 
e=0.2105e 4. a(n) 0.295 (for h>11km). 


In what follows a is considered as a control element, with the following 
inequality constraints: 


a;<a<ap. (4.115) 


__We will construct an optimal synthesis of the angle of attack a(é, y) = 
=a(t, h, V, 8) which moves any point of the space (¢, A, V, 8) intoa point fi, Ait, 81, 
Vi with the maximum velocity V;. 

Here f;;=100sec, Ayp=19,000m, O;=0.174 rad. 

This is clearly the dual problem of the problem of minimum time (or 
minimum fuel consumption, since the dependence m(t) for reaching the point 
th, Ait, O1f, Vig is known). 
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Instead of a problem with a fixed right end point, we will consider a 
problem with a free right end point for the functional 


3 
1=Vit DA (yl— yh), (4.116) 
bard 


where Ag,3 are some positive constants. 


4.5.2. Changing over to new coordinates. Description of V,(¢) 


The set V,(¢) is defined as a parallelepiped in the y space, described by 
the following inequalities: 


ViQ<Vv 
6—e<e<6-+e; (4.116a) 
h—x<h<ht+x, 
where 
V =0.000066#2 +. 0.0027; (4.117) 
@= 0.0002163¢2 — 0.038754 +- 1.92; (4,118) 
h=0.0013f + 0.048241; (4.119) 


y=0.2 km/sec; e=0.1 rad; x=1km. 


This definition of V,(t) is dictated by tentative estimates of that region of 
the (4, y) space where the optimal trajectories corresponding to the sought 
synthesis lie (we do not intend to consider this problem in any detail here). 

Given this region V,(t), we can change over to new coordinates (ft, z) using 
the transformations 


y=yl (t) +21, (4.120) 
where j! (t) are defined by (4.117) (4.119). 
The equations of motion thus take the form 


1 
m(t) 


—([05P+X,(V+2!, 4+2%)] a?—mg sin(6-+z2)} —0.000132¢— 0.0027; (4.121) 


zl f'(t, z,a)= {[P—X,(V +21, h+23)— 


z= f? (t, z, 2) ag Par te, h+z')}a— 


— mg cos (8+ 22)] —0.0004326¢ + 0.03875; (4.122) 


B= fr(t, z, a)—=(V +z) sin (6-+-27) —0,0026¢— 0.04824. (4.123) 
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4.5.3. Solution of the problem 

By applying the method of Chapter III, we reduce the solution of the 
problem to finding a function 9(¢, z) which satisfies the following conditions 
of an absolute maximum of the functional:* 


P(t,z)= a Ais (1,2,0)—= an [ef (42.0) eO)]= c(t); (4.124) 
@ (t,,2)= 2! 4D), (z{-+2/,)? + (4,21) = const, (4.125) 


in other words, we have to solve the Cauchy problem for a first-order non- 
linear partial differential equation. 

No exact solution of this problem can be obtained in general. We will 
therefore use the approximate method of optimal synthesis described in 
§ 2.3. The function gis sought in the form of a polynomial 


2 2 2 
e(tz)= > (24 & (22)! (x ditt )) (4.126) 
i,=0 


= 1,=0 


with unknown coefficients z,1,7,(#). There is a total of 27 coefficients here. 
We further demand that conditions (4.124), (4.125) be satisfied for all ¢ at 
27 specially constructed reference points in V,(¢) (and definitely not every- 
where inthis region), which are obtained by combining the following values 
of the variables z!: 


2% —e, 0, +e; (4.127) 


zi; 0, yy 2y; 
z: —x, 0, +%. 
In this case the reference curves — the loci of reference points — are 
parallel straight lines ze =const, B=1!, 2,..., 27, in the (f, z) space. 
The function R may be represented in the following form, which is par- 
ticularly convenient for minimization with respect to a: 


R= Ae?+ Ba+C+9,, (4.128) 

where 
Aso eu [os E+ Kxt V2 +29 (4.129) 
B= lesser (Stree V +21, ite]; (4,130) 


io) 


=9e [Xo +2484 2)—gsin@+2)— 


— 0.000132 — 0.0027 | — ex [esctiees 
Pe V + 2 + 


+0.000432/— 0.03875] + e2[(V +21) sin(@-+ 22) — 0.00264 — 0.04824]; (4.131) 
7,1and @ are partial derivatives of the polynomial (4,126) which, after ex- 
pansion of the parentheses and transition to a continuous numeration of the 


unknowns, take the form 


* In maximizing the functional, the condition of maximum of R in Theorem 1,1 is replaced by a condition 
of minimum, 
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P= Pr Fez" Yaz? + Hz? + 5 (21 +96 (27P + 7 (29P + 
TF Y—2tz? + Yyztz8 + hyp2729 fF hy12! (27) + hyo2? (27h + 
1452? (2° + dra(24P2? + tas (2172? + das (2729 + 
“P47 (21P (2P + hag (21? (2° + Hao (272 (25? + a2 2723 
+ ¥n2!22(23P+ de (22P 2+ tz!) 2228 
$F Yng2* (27)? (29)? + os (Z1)2? (25)? + Pog (24)? (22) 29 + or (2"P(2?P(Z*P, (4.132) 


Minimization of the quadratic trinomial (4.128) in a under the above 
constraints on a gives the following results, which depend on the value of 
the trinomial coefficients: 


P(t, z)=inf R(t, 250)=H(, 2) + 9(4), (4.133) 
where for A> 0, 
# =Azr+Ba,+C G=a, -2 <a,; 
pp Be gee pte wii 
7 +C a a an< oD <a; (4.134) 
H#=A2+Ba,C a=a, — A> a; 
for A<0, 
4 =Aai+Ba,+C a=a,; (4.135) 


H# =Aozi+Ba,+C a=oy, 


and the coefficients A, B, C are obtained from (4.129) —(4.131). 
Since (4.124) must be satisfied on the reference lines 23, we obtain 


the relations 


1p = — Hp (P—1,2,. . ., 27), (4.136) 


which constitute a system of first-order nonlinear ordinary differential 
equations. These equations are not solved for the derivatives tbe, but these 
derivatives enter the equations in linear form. Using matrix notation, we 
may write (4.136) in the form 


Abp= — He, (4,137) 


where A is the matrix of the system coefficients, His the column vector of 

the free terms, %,is the column vector whose elements are the unknowns. 
The solution of the above system of equations with the reference lines 

Zp =constis obtained without considerable difficulty, since the elements of the 

direct coefficient matrix Aare constant at any time ¢€ (fo, 41). To obtain the 

solution, we first solve the system for », 


p= AY, (4.138) 


and since the elements of the direct coefficient matrix A are constant at any 
time, the inverse matrix A~! should be determined only at the very beginning 
when solving (4.137), 
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The initial conditions for ps (Bp=1, 2, ..., 27) at time ‘=¢, are obtained 
from (4.125): 


® (4, zl)=2}2,(2—z8P $A, (B— 27" + P(A, 2}, 21, 2)=C, (4.139) 


or 


® (t,, 21, 22, 28) 21d, [(23P + Qzt2? + (227) + (4 yP— 
— 2228+ (Pt beet + tezt + et + 
+5 (2)P + 95 (22? + by (28? + Yaz }z3 + 
+ oztedt du0z%2} + dz} (22? + dezk (P+ 
H 1923 (23)? + daa (ZH)? 23+ tas (2)? 28+ 
+ dig (2q23 + Par (21? (27)? + aa (24? (AP + 
tig (Zz (29° + Hoo 212724 + dorziz3 (28)? + 
Paz y(2°24 4 tao 212324 + Feaz1(2PZP 


“das 2))°24 (21) + taol2])2 (2D 2} +e (PPP =O. (4.140) 


Condition (4.125) in the above case may be exactly satisfied if we request 
that 


oy =1 Ft do 252 -b bez + bz} + hi(27? + bo23? + 
ADU, 2122-4 Dyszlzt-t Wyp2t(222-+ Mg 21 (27+ dagetz + 
bon 25 (27)? doo (ZF) 23 + Boge 1ZIZF + doy (277(23 + 


> ag 21 27 (z3)? + Qhigzt (27°23 + Why, Zz} (22)? (z3P2 = 0; (4.141) 


ae == DAZ? -f by — DyZ? + DWygz?-+ yz} + bro 29 + 212425 + 
H+ $13 (24? + Pra2q? + Qhrg2Zt + 2 2321? + Ahigzy(Zq? + 
+inelel tone! (el! + Mineletet + ta (eD) 2h + 
Ht bog2 125 (23) + 05 (2]) (24 + Dag (272724 + 
Ft 2bo7 (21)? 27 (27)? = 0; (4.142) 


os = 2oz3—Dgz8 $oy+ Qy,z3 oz} + Pozi} + 
fF 2; 22] ZF + 2132723 + 5 (24)? + Vig (27)? + Hig (Zf 23 + 
4 2o(eDPet + delet + Bz zz4 + dre} (29)? + 


tag (21)°25 + Whog2z} (24 P23 + Qos (2])2z724 + bog (27)*(24)? + 
$297 (24)? (27)? 28 0. (4.143) 
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It follows from (4.141) —(4.143) that at time ‘=f, the functions ‘ps (¢,) should 
take on the values 


Wy Ay (222 —2,(23, tye=1, b= 222, 


p= 223, be= — Dy, Hye Do (4,144) 


p ’ 


bot =SUg= . . = ty =0. 


Here A, and Az are some positive numbers, which are sufficiently large in 
our sense. 

The solution of system (4.136) with the above boundary conditions gives 
the unknown coefficients w3(f) of the polynomial (4.126) and thus determines 
the sought approximate expression of the function @, its derivatives, and 
finally (from (4.134) and (4.135)) the expression for a(t, 2). 

Simultaneously with the integration of (4.136), we can integrate the 
equation 


A=sup R(t, z)—inf P(t, 2) (4.145) 
Vy Vy 


with the initial condition A(t,)=0. This gives an estimate A of the 
approximate synthesis at the initial time. 


4.5.4. A scheme for a computer 


The problem was solved on the BESM-2 computer. The special program 
used in this case consists of two parts: the direct program, which solves the 
system of ordinary differential equations (4.136) and determines the error 
in the direction from ¢t,= 100 sec to fy= 30 sec, and the inverse program, 
which is used to construct the optimal trajectory for a certain characteristic 
combination of initial conditions. 

The error of the method, the optimal trajectory constructed, and the 
corresponding optimal program for the angle of attack a(t) are shown in 
Figures 4.9 through 4.13. The following initial conditions were use: f,=30 sec, 
V(to) =0.139km/sec, O(é) =1.115 rad, A=2.7km(z!'=z2=z3=0). 
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Chapter V 


OPTIMAL PROBLEMS IN THE DYNAMICS 
OF COASTING IN THE ATMOSPHERE 


The present chapter deals with the general variational problem of the 
dynamics of two-dimensional (plane) motion in the atmosphere of a winged 
aircraft with controllable aerodynamic lift; constraints on the coordinates 
and the control element are taken into consideration. We will analyze a 
typical case when the sought optimal control is degenerate (the case of 
sliding control). It is shown that if the limit values of the control function 
are equal in magnitude, the analysis of the sliding control reduces to 
minimizing an integral functional without differential constraints. The 
results of this analysis are then applied to solve the problem of the optimal 
descent of an aircraft in the atmosphere with minimum added integral heat 
in the critical zone of the aircraft. The method of multiple maxima de- 
scribed in Chapter III is used. 


§5.1. OPTIMAL TWO-DIMENSIONAL COASTING 
OF WINGED AIRCRAFT IN THE ATMOS- 
SPHERE IGNORING RANGE 


5.1.1. Statement of the problem 


We will consider the motion of a winged aircraft, with its engines 
cut off, in the atmosphere of a spherical non-rotating Earth under the 
following conditions: 

1) equations of motion of the center of mass of the aircraft: 


h=V sinO; (5.1) 
V=—X(h, V, cy)—g sind; (5.2) 
b= IV (h, V, oo) +(— 5 cose]; (5.3) 
2) boundary conditions 
4(0), V(0), 8(0)E B(0); (A(4), VA), 8(A)) € BAD; (5.4) 
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3) constraints, for ¢€ (0, %), 
VY, VEBO, cyE ley, (4,4,V), Cy, (4, 4, V)). (5.5) 


Here fh, V, 8 are respectively the altitude, the velocity, and the in- 
clination of the aircraft trajectory (the phase coordinates); cy is the lift 
coefficient of the aircraft (a control element); Y(h, V, cy), X(h, V, cy) are 
the lift and the drag per unit mass of the aircraft; gis the gravitational 
acceleration; r, is the Earth's radius. 

Equations (5.1)—(5.3) are considered ignoring the horizontal flight range. 
The functions X(h, V, cy), Y(#, V, cy) are assumed to be described by the 
ordinary equations of stationary aerodynamics: 


X (A, V, cy) = SE ce (ex,(h, V)+ b(A, V, Cy); 


G 2 
Sg ev? 
V =A fe 
Y (A, ’ a) G 9 yy 


where o is the density of the atmosphere, Sis the characteristic area, G is 
the aircraft weight; for every fixed A and V, 6(h, V, cy) is a symmetrical 
concave function of cy. The variation of the gravitational acceleration g 
with altitude is ignored. The last assumption, like the neglect of the Earth's 
nonsphericity and its spin, introduce very slight errors in the computation 
results, and yet greatly simplify the mathematics. These effects can always 
be restored in full without any modification of the basic mathematical 
apparatus. The change in aircraft mass associated with possible heat sub- 
traction is also ignored. The functions A(t), V(t),6(t)are continuous and 
piecewise-differentiable, c,(t) is a bounded piecewise-continuous function. 
These assumptions are commonly used in variational problems of flight 
dynamics. 

Let D be the set of the admissible motion programs A(t), V(t), @(¢), cy(Z) 
satisfying the various conditions and assumptions. 

Our problem can be stated as follows. Find a sequence of motion 
programs 


{a,(), Vs), 5(4), ey(#)} ED (5.6) 
on which the functional 


f 
T= {F(t V)dt+ F (lo, Voy 8 ry Vip 64) (5.7) 
0 


goes to its least value over the set D. 

The form of the functional (5.7), the constraints (5.5), and the boundary 
conditions (5.4) depend on the particular problem being considered. They 
will be improved and adjusted when necessary. Without loss of generality, 
we may take Cyi<Cyz. 
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5.1.2. Analysis of the optimum conditions 


We will now investigate the function R for the given system comprising 
the functional and the constraints, 


R=V sin® —gy (X (h, V, cy) +g sin 8) + 
tea [PVs e+ (Fe) cos] — Uh, V)+er (5.8) 


If the function g(t, 4, V, @)is given, the function R for some fixed ¢, h, V, 8 
will depend on ¢, only, and its character, in virtue of the properties of the 
functions X(A, V, cy) and Y(h, V, cy), will be determined primarily by the 
magnitude and the signs of the coefficients of py and gs. Suppose that the 
following conditions are observed in some region of the (¢, A, V, 6) space: 


Py <9; (5.9) 


Pv [x (4, V, Cy,)—X (A, V, ey, )]—%e ald (4, V, ey,)—Y (A, V, Cy,)|=9. (5.10) 


As is readily seen, R attains a maximum for two values of cy, namely 
ey(th, V) and cy(t,h,V). If the sought optimal program belongs to the 
relevant region and satisfies conditions (5.9), (5.10), it constitutes 
degenerate (sliding) control with degeneracy & of at least 1. If conditions 
(5.9), (5.10) are not satisfied, the maximum of & is attained only on one 
Cy, either inside the segment [Cy1, cy2] or at one of the end points Cyt, Cy2. 
The corresponding optimum control is of Euler type in the former case 
and of boundary type inthe latter. Weonly investigate the conditions 
corresponding to degenerate programs since, first, as we shall see from 
what follows, they are quite typical of the problems being considered, and, 
second, Euler and boundary optimal controls have been widely treated 
in the literature /2—5/. 

Following the scheme described in Chapter III (the method of multiple 
maxima), we will find the general solution of the partial differential 
equation (5.10). To this end, we first determine the independent first 
integral of the system 


B= [XH V, Cy —X Vy Cy) (5.11) 
1 
= IV (hy, Vi 4) -¥ (HV, Condh (5.12) 
which has the form 
c= 0+ (KG, h, V)aV, (5.13) 
where 
te mi—Ve , 1 
K (th, Va (5.14) 
Xi2=X (A, V, Cyryo(A, V, t)); (5.15) 
Yy2=Y (A, V, Cy2(h, V, 2)). (5.16) 
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The general solution of equation (5.10) is an arbitrary continuous and 
differentiable function 


g=ald, 4, bE, 8, 4, V)). (5.17) 


Inserting this function in expression (5.8) for R, we find 


R,=9,V sin 6—o {K (t, 4, V)[X (A, V, cy) +g sin é] + 
+ [YOY cn) +(Z—e) cos 0 ]}— P(t, V+ 40 (5,18) 


or 


‘ 1 X\Yo—XoVy 
Ry=9nV sin §—o, VT oa + 


4K (t, h, V)g sind — (-«) cos 0} — f(h, V) +e, (5.19) 
TE 


where V and 6 are related by (5.13) 

Here A, £ are the phase coordinates (the arguments of the function g) and 
V and ® are the control elements (they are related by (5.13)). Using (5.13), 
we can eliminate 6. The function R;for every fixed ¢ will then depend on 
h, & and V, with V acting as a control element. 

Further investigation of the function R, in general form does not achieve 
any useful purpose. One of the ways for further solution is to change over to 
Problem 2 and minimize the functional (5.7) with the following differential 
constraints: 


h=V sing; (5.20) 
co) XY e— Xo i Vee 
ba ST KG Ve sine + (Te ) cos, (5.21) 


where 


b=t—( Kt, h, V)aV. 


As we could have expected, this problem is simpler than the original 
problem since the order of system (5.20), (5.21) is 1 less than the order of 
(5.1) —(5.3). 

Problem 2 can be solved by the method of approximate optimal synthesis 
(see § 2.3). This method can be applied directly to the original problem (if 
the constraints on the phase coordinates are sufficiently simple), but it in- 
volves integration of a system of differential equations of order m+n where 
n is the order of the system of constraints, and m is the degree of the 
approximating polynomial. Therefore, even if the order 7 is lowered by 1 
only, the result is a substantial reduction of the overall computational work. 
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Once Problem 2 has been solved, we have to check the applicability of 
conditions (5.4), (5.5) and establish that the solution indeed belongs to the 
angle w(t). The latter is verified by inserting the solution hA(f), V(t), 6(£) in 
the equation 


= [mi tator +(e) cos | (5.22) 


which is then solved for v. If all the above conditions are satisfied, the 
solution of Problem 2 defines the zero closeness function of the sliding 
control. The original problem is thus solved. Otherwise, as we have seen 
before, the function 9 is constructed by a different method. Although in 

this case the solution of Problem 2 does not solve Problem 1, it nevertheless 
provides some information concerning the type of the sought solution. Using 
this information, we can construct a reference solution which is subsequently 
improved by some scheme of successive approximations, e.g., by the 
method of approximate optimal synthesis for the original problem. A 

clever choice of the reference trajectory reduces the region in which the 
optimal synthesis is constructed and thus lowers the degree of the 
approximating polynomial. As before, this leads to a marked reduction in 
the volume of computations. 


5.1.3. A special case of a symmetric constraint on 
the lift coefficient 


Let 
leat, 4, Vyl=leyo(t, 2, Vil =cytim(?, 4, V). (5.23) 
In this important case, the analysis of our general problem can be 


continued much further. Since the function X(h, V, cy) is symmetrical, we 
have 


X (A, V, Cy)=X (h, V, Cy), 


and condition (5.10) takes the form @, =0. This means that the function » 
is independent of @. The function A, takes the form 


Ri=F1,V sin 8 — py [|X (A, V, Cyrim(t, 4, V))+ gsind}]— f(t, V) +o. (5.24) 
It follows from the structure of 9; and Ri that sin @ is the control element 
in this case: it enters the expression for Ri inlinear form. Problem 2, in 
its turn, is degenerate and it can be solved by the method of multiple 


maxima. We define 9;(4 hk, V) so that the coefficient before sin 8 in (5.24) 
identically vanishes, 


Pav — %yg=0. (5.25) 
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The general solution of the partial differential equation (5.25) is an 
arbitrary continuous and differentiable function 


= (Ft, 1(4, V)), (5,26) 
where 
y2 
se (5.27) 


Ohi 
at 
(5,28) 
OV 
a = 
Inserting this solution in the expression for R;, we find 
R= — 2X (h, V, Cyim(, h, V))— F(A, V)+ 90 (5.29) 


where h, V, and n satisfy (5.27). In (5.29), nis the phase coordinate, and 
fh and V are the control elements (they are related to yn by (5.27)). We can 
change over to one independent control element, / say, by expressing V 
from (5.27) in terms of 9 and A: 


=YV 2g(n—A). 


As in the general case, it is better at this stage to pass to a modified 
problem (we call it Problem 3) minimizing the functional (5.7) with a single 
differential constraint 


= —— XU, Vs Catim (ts A, VY) (5.30) 


where h, V, 6 satisfy (5.27). Problem 3 can be solved by approximate 
optimal synthesis, which in the present case virtually reduces to integrating 
a system of first-order ordinary differential equations where the number of 
equations is equal to the degree of the approximating polynomial plus 1 
(one-dimensional synthesis). 

The resulting solution (A(t), V(t) of Problem 3 should be inserted in the 
equations of Problem 2 in order to obtain sin 8. If the result satisfies the 
constraints, (A(t), V(t), a(t )) is a solution of Problem 2. We treat this 
solution as before. 

In practice, the constraints on cy and the phase coordinates in problems 
of this kind are generally determined by physical constraints on overloading, 
temperature, etc., which depend on the flight altitude and velocity and are 
independent of time. In numerous problems the final time ¢; is not given 
(e.g., the problem of the minimum added heat in the critical zone, which is 
considered below). In this case, the right-hand side of equation (5.30) is 
time-independent. Using this fact and further remembering that any 
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physically admissible motion of the aircraft with the engine cut off, de- 
scribed by equations (5.1)—(5.3), is constrained by the inequality n<0 

(the total energy of the aircraft cannot increase with time), we may change 
over from ¢ to a new independent variable 7, without distorting the physics 
of the problem. Problem 3 is thus finally reduced to the problem of 
minimizing the functional 


™ 
—— pant F (Ag, Vox 80s Bir Vay 91)s (5.31) 
oe X (hk, V, cy) 


To 
where 


without differential constraints and with inequality constraints of the form 
(5.5) imposed on the free functions f(y), ¢cy(n) and on the parameters 05 

and 9;; in other words, the general problem is reduced to an elementary 
problem of the absolute minimum of a function of a single variable for every 
fixed n. Let f? (ny, 4, cy) denote the integrand in (5.31), The solution of 
Problem 3 then coincides with the solution of the equations 


Fi(n, A(M), cy(A, 0) = ink Ai(A, fy ey), TE (My, No); (5.32) 
hEB(n) 
l¢yl < ¢ylim(4. 


F(h, Vo (Ro, Nb), 60, ii, V, (hy, 1), 6)= 


= inf (A, Vo (Ko; No) 6; h, V(Ay, 1), 6,), (5.33) 
(Ao, 80) € Bo 
(41,63) € Br 


where B(n), Bo, Bj are the regions of the admissible values of h, 00, 9: defined 
by conditions (5.5), or the regions of those A, 69, 6; which are actually 
attainable for system (5.1) —(5.3) under conditions (5.4), (5.5). 

Although the solution of Problem 3 is elementary, we have to consider 
its existence in the set D. The structure of the functional in Problem 3 is 
so simple that it permits drawing some important conclusions concerning 
the feasibility of the solution of this problem. Indeed, let us find the 
maximum of the integrand in (5.31) with respect to cy for any admissible 
fixed ) and h. Note that the corresponding optimal value c’, (n, 4) depends 
only on the sign of the function f? (A, y, cy). If f? <0, there is only one 
optimal value of Cy, c =0. If f9>0, the minimum of [9 is attained for two 
different values of ¢y, 


cf =beyiim (A, 0). 


In the negative case, the substitution Cy=cy in the original system deter- 
mines, in combination with the initial conditions, the unique solution of 
system (5.1)—(5.3), i.e., a ballistic trajectory which does not necessarily 
coincide with the solution (A(y), V(n), 6(n)) obtained under completely 
different conditions. 
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In the positive case, the solution of Problem 3 does not necessarily 
satisfy equations (5.1) —(5.3) for Cy=Cy,. It only has to satisfy the constraints 
onsin 6 andtolie within the angle w constructed at each point of the 
trajectory. In this case the solution can be realized by a sliding program — 
a succession of control functions, each comprising alternating flight sections 
with maximum and minimum angle of attack. 

The denominator of the integrand in (5.31) for any values of the arguments 
is definite positive, so that the sign of the fraction f{ coincides with the sign 
of the numerator, i.e., it is determined entirely by the particular form of 
the function f°(h, V) (the given optimality criterion). Thus, if f(A, V)=1 
(the problem of minimum descent time), the solution of Problem 3 is indeed 
feasible. 

If, on the other hand, fo(h, V) =—1 (the problem of maximum lifetime), 
the solution of Problem 3, strictly speaking, does not exist in D and an 
alternative method has to be used. 

If fo(A, V) changes its sign, the solution of Problem 3 is strictly feasible 
in that region of the (4, V) space where /°(h, V)>>0. 


5.1.4. Generalization to the case of additional 
control elements 


The preceding results are generalized without much difficulty to the case 
when the angle of attack cy is not the only control element: other common 
control elements are the angle of bank y and the drag coefficient Cx9. In 
practice, Cxo can be controlled with the aid of braking apparatus in the form 
of skirts, flaps or air brakes, and parachutes. 

Let us consider the previous problem of minimizing the functional (5.7) 
over the set D of flight programs satisfying the following conditions. 

1. The equations of motion 


h=V sin6; (5,34) 
V=—X(h, V, Cx, cy) —g sind; (5,35) 
pose Ware 
b=2 [y (a, V, ay)eos y+ (— g cos], (5.36) 


where &f, V, @ are the phase coordinates; Cy, y, €xo are the control elements. 
2. Boundary conditions (5.5), with free end time /¢;. 
3. Constraints, for ¢t € (0, 41), 


(h, V, 8) CB; 
(Cx, Cy) CQ (A, V). (5,37) 


The angle of bank y is either not constrained (—I<cosy<+1), i.e., 
equations (5.34) —(5.36) are in fact the projections of the three-dimensional 
equations of motion on the vertical plane, or takes on the values 0 and x 
(cosy=-++l)only, i.e., the aircraft is truly constrained to perform two- 
dimensional (or almost two-dimensional) motion. 
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We will accept all the assumptions listed in 5.1.1. 

Note that owing to the presence of the additional control element y, we 
may treat the product Y(A, V, cy)cosy as an independent linear control 
element Y, which assumes values inside the interval 

= LL 
[ Yam’ 1 Yasim 
where Y,,;,, may depend on fh and V. 
The function R takes the form 


R= sind— oy [X (hy, Vs Crosty Je sino] +9 5 | Ya + (—e) cos | +o. (5.38) 


Successively applying the method of multiple maxima and changing over 
to a new argument, we end up, as before, with Problem 3 minimizing the 
functional 


( f0Uh, VV) 


= i AN+F (to, Vor 95 M1, Vi, 94), (5.39) 
eX ltr Vieeys Cu) 
No 
where 
V=YV 2g (1A). 


For every 1€ (1, No), the solution 2(n), @x0(n), y(n) of Problem 3 is de- 
fined by the condition 


Fi(M, A(M), Cx,(M), Cy(M)) = ink /T(N, A, Cx,, Cys (5.40) 
AEB, (4) 
(4x04 €y) € Qa (A 8) 


where f° (n, A, Cxo, Cy) is the integrand in (5.39). In other words, for every 
n€ (m1,m0), the values of h,éx0,@ are obtained as the point of absolute 
minimum of the function [® (n, A, cx, cy) with respect to all the three in- 
dependent variables h, Cxo, Cy. 

Owing to the additional control element y, the transition to Problem 3 
does not require any assumptions concerning equal limits of variation of 
the angle of attack. In general, the solution of Problem 3 exists in D 
when é,(n) is unique, provided that é,(n) 40. - 

Indeed, inserting h, éy, xo in the initial conditions, we find 6(n) and 
then some function Y, (n) and the corresponding "required" value (cos y)r. 

If |(cos y),|<1 and the solution (%(n), 6(n)) satisfies the boundary conditions, 
we have solved the original problem. If the angle y is not constrained, 

we may take cosy = (cosy);- If cosy is constrained to the two values + 1, 
h(n), 8(n) may be approximated in D by constructing a sliding program with 
the basis control functions (é:9(y), @(m), cosyi,2), where cos y;=— 1,c0s ye=+1, 
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§5.2. OPTIMAL DESCENT PROGRAM OF AN AIRCRAFT 
IN THE ATMOSPHERE BASED ON THE CONDITION 
OF MINIMUM ADDED INTEGRAL HEAT IN THE 
CRITICAL ZONE 


5.2.1. Statement of the problem 


Consider the problem of aircraft descent in the atmosphere with minimum 
added heat in the critical zone. The added heat is estimated by the 
functional 


4, 
kee V3.5 at, (5,41) 


where K, is a constant. 
The admissible motion programs satisfy the following conditions. 


The equations of motion 


h=V sind; (5.42) 
V=—X(h, V, cy)—g sind; (5.43) 
‘ 1 y2 

a= Bralcescer —g)coss], (5.44) 


where 
Y,=Y (h, V, cy) cos y. 


Boundary conditions: 
Alternative 1. Only the initial and the final energy levels are given: 


#=0, 1(0)= Nor 
f=4, 10) =% rs) 


(transition from one energy level to another). 
Alternative 2. The initial values of the phase coordinates and the 


final value of the energy level are given: 
t=0, A(O)=Ao V(Q)=Vor 9(0)=% U=TMe- (5.46) 


In both alternatives, ¢, is free. 
The sought optimal program is expected to satisfy the following 
constraints; 


1) ey<eyiims (5.47) 
2) AD Mimi (5.48) 
3) [N(A,V, or= [X2(h, V, ey) +Y2(h, V, cy] < N23 (5.49) 
4) 9, (h, Vy Cy) <4. tim: (5.50) 
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Inequality (5.47) is an engineering constraint on the lift coefficient; ¢y1im 
may be identified with cynit, say. In (5.48) hiim is the geometrical altitude 
limit, i,e., the ground level. Relations (5.49), (5.50) impose constraints 
on the total overload NM and the heat flux g, at some critical point on the 
aircraft surface. The limit values of these parameters are fixed from 
structural strength considerations and from safety requirements for the 
crew and the instrumentation. It is assumed that for any fixed handV, 

q. (4, V, cy) is a symmetrical and monotonic (on both sides of the origin) 
function of Cy. 

From considerations of problem solution, constrainsts (5.49), (5.50) 
are better written in the form (5.5). Note that both N® and gq; for any fixed 
h, V have a minimum for c,y=0. 

Condition (5.49) is then equivalent to the following two inequalities: 


(N),9 < N23 (5.51) 
lev < oi (5. 52) 


where c(h, V)is the positive solution of the equation 


N2(h, V, cy) =N? 


hs (5.53) 
for any fixed fh and V which satisfy (5.51). Clearly, for other h, V, the 
solution Cj, does not exist. 

Condition (5.50) is similarly equivalent to the following inequalities: 


(Vey=0 < % rind (5,54) 
fy <c7" (A, V), (5.55) 

where ef (A, V)is the positive solution of the equation 
q: (A, V, Cy) = 4.1m: (5.56) 


The functions (N?(A, V) )eyro and (9,(h, V))cy~o decrease monotonically 
with increasing altitude and increase monotonically with increasing velocity. 
Therefore (5.51) may be written in the form 


A>h(V), (5.57) 


9t=Qtlimcyeo 
N= (Mim) ey «0 


V=const 


FIGURE 5.1 
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where A*(V) is the solution of the equation 


(N?(h, V))cyao =NI (5.58) 
for any fixed V, and (5.54) may be written in the form 
AD>K*(V), (5.59) 
where h**(V) is the solution of the equation 
(9, (4, V)cy-0 = %e.1im (5.60) 


for any fixed V. 
Summing up, we can write the various constraints in the following form 


(Figure 5,1): 


iim. for Ay, >A", Ae 


ADh(WV)=\n"(V) for A’ > hyn, A (5.61) 
Ae(V) for AMS Ay, h* 
yim = for tyim<ej, 6)" 
Icy < cyp(4, V) = hey (4, V) for ch <eyiim, ¢;° (5.62) 
ey(h,V) for cl < ¢yiim, ey 
—Y (A, V, Cyn) <Y,<Y (A, Vs Cyp), (5.63) 


and in this form they will be used in what follows. 

Note that the physical meaning of the constraints and the corresponding 
classification are of no consequence from the point of view of the method 
of problem solution. We are merely concerned with the formal classification 
of constraints, which distinguishes between constraints on the phase 
coordinates (group (5.61)) and constraints on the control elements depending 
on the phase coordinates (group (5.62)). 

We can now present a rigorous formulation of the problem, denoting by 
D the set of admissible motion programs satisfying the above conditions: 

Find a sequence of programs {(fs(t), Vs(t), Os(t), Cys(t), ys(t))}GD on 
which the functional (5.41) goes to its least value over the set D. 


5.2.2. Solution of the problem 


Seeing that the time ¢, is free and constraints (5.61) —(5.63) do not depend 
explicitly on time, we will solve Problem 3 corresponding to the original 
problem. We thus minimize the functional 


1 
0° 5 15 


f=—K, an, (5.64) 


4 
Ps X(h, V, cy) 
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where 
V=V2¢(0—A) 


under constraints (5.61), (5.62). The minimum of the functional (5.64) 
yields a certain solution %(n), @,(m) on which, for every fixed n € (m1, to), 
the integrand in (5.64) attains its minimum under the same constraints. 

Since in our case f°(h, V)>0, the minimum of the integrand for any 
fixed yn and A is attained for |cy|=cyp (4, n). We now insert |cy|=cyp in 
(5.64) and find the minimum of the function 


oe 5y3- 5 


(0, A)= (5.65) 


—X(hV, h, 
z ( Cyp(A.n) 


on the set hA>hp(n). Here hp(n) is obtained by 
qaconst V4 transforming /}(V) to the new coordinates (see 
Figure 5,1). 

Nv Figure 5.2 shows some typical curves of the 
NL function x(h) for several fixed values of n. The 
minimum of (A) is attained at the kink point op, 
which is precisely the point of transition from 
FIGURE 5.2 one Cy, Cylim, to another ¢} or c%}. In other words, 

this is the point where 


ce te 


N(A, 4, Cytim )=Niim 
or 2 
9(h, 0, Cytim )=41im- 


The solution of Problem 3 for the two alternative sets of boundary 
conditions thus has the following form (Figure 5.3). 
Alternative 1. 


Fn = {hen for Aopt(1) > Atim 
yim f0r opt < Mim: (5.66) 


nClm» ol 


oO F’ 
1 Jo 7 


FIGURE 5.3 
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Alternative 2. 


h=foe for N=N 
= fhopt (A) for opt (> him. 4 E(m, Np) (5.67) 
Aim for hopt<hiim. 


For both alternatives, 
leyl=Cy iim: 
We will now consider the existence of the solution of Problem 3 in D, 


i.e., the actual construction of the sought minimizing solution. 
In equations (5.42) —(5.44) we change over to a new argument 1: 


ro esing in 9, c,): 5.68 
. X (hk, V, cy) FM fe 8 eu) ee 

e| ¥a+ (a) eos | 
Pe 7 eee A, 6, a 5.69 
e V2X (h, V, Cy) Pn, °v) ( ) 


_ The solution of Problem 3 is feasible in D only if the direction (1, 6’(n), 
A(y)) in the (A, 8, n) space, for every , lies within the angle w spanned by 
the vectors 


a,=([1, J, i, 6, Cytim)] 
and 


ao =(1, iM, h, 6, —ty), 
where f=(f1, f?). 


In other words, the substitution of 6’ (nm) in the equation 
0 (n= vf? (1, BB, ea +U—Y L200, hy + Gnind (5.70) 


should give v which satisfies the constraints 0<v<1. 

At the same time, the functions %(n), 6(n) may be discontinuous. The 
values of %’, 0’ at the discontinuity points are not bounded, and the direction 
[!, %’, 6] therefore a priori does not lie within the angle w since the functions 
f2(h, 8, 4, Cyn) are bounded. In this sense, the solution of Problem 3 does 
not solve at the same time the original problem. However, using this 
solution as a guide line, we can construct some program A(n), 6(n), Gy (n) 
from D which can be regarded as a potential optimal program. 

Note that under the above constraints, the condition 0<v<1 is generally 
satisfied on the continuous sections (%(n), 6(n)) for a wide range of n, 
with the exception of very small n, when motion along Ropr(n) requires 
smaller values of Y, than the Ya,,_, available. 

A potential optimal program constructed in this way is shown in 
general outline in Figure 5.3. It comprises flight sections corresponding 
to the limiting values of Y, and sliding control sections, with the control 
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switching between the two limit values of Y, along the (ny) line. In 
particular cases, some of the sections shown in Figure 5.3 may be missing. 


5.2.3. Estimates of the solution 


Since the solution A(n), 6(n),¢y(n) constructed above is only a potential 
optimal solution, it requires further verification. One of the possible 
techniques is to verify the sufficient conditions of a local minimum /1/. 
Another approach is to improve the solution by successive approximations 
(e.g., by approximate optimal synthesis), using the available solution as 
the first approximation (a reference solution), 

We propose to check the potential optimal solution by comparison with 
another solution (A(n), @(y), cy(m))- which does not necessarily belong to D 
but on which the functional is a priori known to attain a value not exceeding 
its exact lower bound on D. 

Let 


(4 (n), 8 (), Cy (n)) =2. 
Then 


1 (2) <int/. (5.71) 


By (5.71) we clearly have 


[al =/@)— int I< @)— 1 (2d 


Thus A/, provides an estimate of the solution, and the accuracy of this 
estimate clearly depends on the particular function 2. chosen. 

We can choose 2,, say, as the solution of Problem 3. For boundary 
conditions in Alternative 1, this estimate is expected to be fairly accurate. 
For boundary conditions in Alternative 2, on the other hand, this estimate 
may prove to be too crude, since in this case the potential minimizing 
solution may markedly differ from z, over some long sections (see 
Figure 5.3). 

A more exact estimate can be obtained, but this requires some additional 
construction. 

Using initial conditions (5.46), let us ponsider the solution of the system 


of six equations (with the argument Q= fo°5V2'8dt) 
3 


as: V sin@ 
Ay=min ( go sya.i5 ) > 


ey78 
: 7 vX(V,h, cy) 
=min{—— ——>24;—}); 
1 min ( zg eo 5y3.15 ) 
V2 
1 ¥e+(—~—e) cose (5.72) 
6 = min | — ———-ZE_~ ___ }. 
Ly veya V 9053-18 ’ 
, V sind 
= max {| ——__. 
U max ( sas) 


Pe ee ee 


The minima and the maxima of the corresponding functions are located 
for every fixed @ in the region 


—Cyrinkt, N) Sly KS Cyind’, 0); 

—Y (A, 0, CH in) <Y,<Y (4, 1, Cin 
h(Q) <A <Ay(Q); 
11(Q) <1 < My (Q); 
91(Q) < 6 < 4(Q). 


(5.73) 


In virtue of the right-hand sides of equations (5.72) (see Appendix), ,(@) 
is some lower limit value of yn (which lies below the exact lower limit of n 
for the original system) for every fixed Q, and its inverse function is some 
lower limit of Q for fixed n (Figure 5.4). We continue the solution of this 
system to its intersection in the plane (fh ,, n,) with the line k(n),1. The 
quantities relating to the intersection point are denoted with an asterisk *, 
A similar construction can be carried out at the left end point of the interval 
(m1, No) until 4, (n,) intersects %(n) at the point n** 


Exact limit 


7L(@) 


g 
FIGURE 5.4 
I, is taken in the form 
yi 00:5/73.15 
I,=K,{ aQ*+aQh+ Wt a (5.74) 
= X (A, Viewind 
& 
The estimate is obtained as follows: 
Alp=|l-—/ (2) | (5.75) 
or 
y emaLe (5.75a) 
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5.2.4. Numerical example 


As an example, we computed the optimal descent of a hypothetical air- 
craft with the following specifications: 


weight G= 5000 km, wing load c= 153kg/m?2, 


The polar curve of the aircraft is 


c,=0.1+ 1.83 |sin* al; (5.76) 
Cy=+ 1.83 sin?a cosa. 


(Expressions (5.76) are borrowed from /6/. ) 


0 +200 tr 300 400 500 ty t 
__ ae . 30040050 tr t 


ag 
| be i [_-+--+- 


2-109! 2-10" 


Benen. 
700 200 300 400 500 t sec 
Q=Q75y es kg 0.5 m5 sec7?-25 arf gat kgs m!#35 seg 72615 
) 


FIGURE 5.5 
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The following boundary conditions and constraints were used: 


N= 2.93-10°m; No= 80-10? m; 6)=0; 
1, = 15-10% m; (5.77) 
Nin=5; 2 Cyrim= 0-645. 


No constraints were imposed on the heat flux. The density of the atmo- 
sphere was taken in accordance with the SA-64 standard atmosphere model. 


FIGURE 5.6 


The computations were carried out on the BESM-2 computer. Equations 
(5.42) —(5.44) were integrated in original form with the argument f. 

The results are plotted in Figures 5.5 and 5.6. 

Figure 5.5 shows the time variation of the phase coordinates h, V, @ and 
the variables Q, Q for Mj,=5,2. Figure 5.6 shows the auxiliary character- 
istics of the optimal program: the trajectory in the coordinates h, x, where 
x is the horizontal range, and fh, yn. Figure 5.6 also gives the function 
h(t). The following estimates are obtained for this program. 

Alternative 1. 


Alternative 2. 


N,=5; A=4.1%: 


lim 


Mim=2; A=0.6%. 


5.2.5. Practical realization of the optimal program 


In the particular problem considered in this section, the limits for the 
variation of the angle of attack are assumed to be equal in absolute value. 
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The sought solution in this case can be realized in more than one way. Let 
us consider in some detail two realizations of the optimal program, 

According to one technique, the angle of attack is controlled by altering 
the angle of pitch while maintaining a constant angle of bank y=Oor y=n. 
The sections with Y,=—Yjmor Ya=Yjimmay be attained by taking cy= —cy);5 
or Cy=Cylim respectively. Sections with —Yin<Y¥.<Yjim in this case are 
sliding control sections and are attained by switching between the two limit 
values of the angle of attack. 

According to another technique, the banking of the aircraft is controlled 
so that cy=cy|jm Or Cy=—Cylim. A number of alternative versions of this 
technique may be considered, corresponding to different programs of al 
altering the angle of bank. All these programs are characterized by the 
same 'mean" projection of the lift on the vertical plane, which is equal to 
Y., whereas the ''mean" projection of the lift on the horizontal plane is 
variable. One of the possible programs may maintain the instantaneous 
angle of bank in accordance with the relation 

cosy=—4, (5.78) 
lim 
another program may rotate the aircraft with a variable angular banking 
velocity, a third program may induce oscillatory movement of the aircraft 
in a certain range of banking angles, etc. This freedom of banking control 
creates wide possibilities for lateral maneuvers. 

With the first control technique, the sliding 
control section is realized by switching over 
between the two limit angles of attack in flight. 
An elementary switching cycle is shown in 
Figure 5.7. To assess the departure of the real 
values of the functional from its lower bound, 
let us consider the expression of the functional 
for cy=Cyb(#,n) The integrand x(A, n) for 
every fixed n depends only on the altitude h. 
The deviations of the functional from its 
minimum are determined by the altitude 


Y-Y, 


0 


deviations 
; | At Ah=h—h 
‘ ” 7 (n) 
2 for all 
FIGURE 5.7 nE(n1, No). 


Let us estimate the magnitude of these deviations 
for different number of cycles. To this end, we introduce the following 
assumptions. In the range of altitudes corresponding to the permissible 
deviations Ah, the velocity V and the density 9 are taken equal to the optimal 
values V(n), o(n); cos@<1;sin@é~@. Then in the relevant range of altitudes, 
for fixed n, 
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We further assume an exponential density distribution in the atmosphere, 
e=ae P86 1.31-10-* m7, 
Under these assumptions, the equations of the relative deviations 
Ah=h—h, A@=0—6 take the form 
(Ah)= Vas; 


a (5.79) 
(48) =a(EY—Y,), 


where 
Y,=W(h, Vy Cyn) + Uw ¥ (th, V—Cy,,,)- 


The change in velocity during a single cycle is ignored. 
The integrand x(n, 4) in (5.65) may be replaced in the relevant neighbor- 


hood of % by the expression 
x(N, A)=x+x,dh, AA>O, (5.80) 
where, as is readily seen, 


n= (V2); 


£(V2)—= —al—<; a=0,5;6=0.007. (5.81) 


Hence we easily find the relative deviation of the functional from its 
minimum over a cycle 


T/2 
ame J AA(t) dt. (5.82) 


Let us evaluate the integral in (5.82) using equations (5.79). Integration 
f (5.79) over At (see Figure 5.7) gives 


ah=(¥—¥,) 2 O.<atc<t, 
g . 
dha (V~Y 4+ —¥,)t (at) SP ‘ 
where 
foi Ne 
te 


Inserting the expressions for Ah in (5.82) and integrating, we obtain 


wapm(BY (S v— ¥,) (74) 4 


Al 
+0 1) (8) (AY -fe tea(Gy+ 
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Hy rk ($8). 


Tc is related to the number of switchings by the equality 


bh, aint 
cu" 2N ’ To” 


The total deviation 6/ is thus expressed by the relation 


“ ive 
Yar), as af \ (AL) 
Ls Caer (7) ane 5.83 
T oar 7 = ; Al t t ( * ) 
a1 
T 
8 
6 
4 
2 
0 
FIGURE 5.8 


The dependence of 6/// on T; is shown in Figure 5.8. It is readily seen 
that a program with T,= 30secis already sufficient to provide an adequate 
approximation to the optimum. The transition time between the two limit 
values of cy apparently can be ignored in comparison with 7,. 

The deviation of the functional from the true minimum in the second 
technique depends on the particular banking control program. It seems 
that, irrespective of the particular program, banking control invariably 
gives a better realization of the optimum for the same number of cycles, 
since in this case |cy|= c,jjmat any time. The program fixing the angle of 
bank in accordance with (5.78) attains the exact lower bound of the functional. 


5.2.6. Remarks concerning a controlled 
drag coefficient 


We will now see how the solution of the problem is affected if Czy is 
controllable within certain limits 


Cxo min SCxo SCx0 max: 


which are either imposed by engineering constraints or are functions of 
h, V, and cy, as it follows from the condition 


N(h, V, exo, Cy) <Miim (5.84) 


(the constraint on the heat flux is disregarded). 
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Condition (5.84) is better written as a system of two constraints 


N(h, V, Cys 0) < Mim; (5.85) 
Cy (C0, Cy) Se pinhA, V), (5.86) 


where Cxiim(#, V) is the solution of the equation 
N2(A, Vy ¢, (Co. Cy))= Nim: (5.87) 


Let us investigate the integrand in (5.64) under these constraints, taking 
Cxo to be variable. Since f/°>0, the minimum of the integrand with respect to 
Cxo, Cy corresponds to the maximum with respect to Cxo, cy of the function 


X(h, V, €x(€x0, Cy)), 


which, for any fixed h, yn, is attained on the maximum value of ¢x(Cxo, Cy), 
i.e., 


Cx(Cxo, Cy) =Cx1im(h, N)- 


Equation (5.87) may be written in the form 


cle, Cy) ee = Se eee (5.87a) 


The function Cx(Cxo, Cy) attains its maximum, under constraint (5.87a), 
for Cyo=Cxomax(h, V), where Cxomax is either the solution of equation (5.84) 
for any fixed A, V (in this case c,=0) or the upper engineering limit 
Cxomaxf (in this case |cy] is obtained from (5.87a)). 

Inserting the results for Cxo and cy in the integrand in (5.64), we find that 
the function x(n, 4) attains its minimum on h=hk(n), which is determined from 
the condition 


Nh, Vy Ceo maxt> Cvtim)= Mim (5.88) 
We thus see that the optimal programs with controlled Cxo are essentially 


the same as with a fixed Cxo. If the fixed Cxp coincides with Cxomm for the case 
of controlled Cxo, the altitude h(n) is evidently greater than with fixed Cxo. 


Appendix 


Estimating the limits of the admissible phase 
coordinates for systems of differential equations 


Consider a system of differential equations 


yf yy. . 6, ya) 
Y=, yy. . ., 9%, 4) (5.89) 
y=fr(t, y', y’, eo 8 ey y", u), 


with given initial conditions 
y(O)=yd, ¥(O)=ys,- . -, y(0)= yb. (5.90) 


The functions fé (t, y', y?, ..., y”, uw) are assumed continuous and differen- 
tiable in all their arguments, u(t) is an r-dimensional vector function (control 
function) whose values lie in some region Q(t y', y*, ..., y") in the 
r -dimensional space. 

We use (5.89) to form a new system of 2n differential equations 


y= ing 1G, ys gy... yt, as 
he ee ee 
gu= sup, F(A y's Wee yt 
wry... ya 
i inf Fb, y}, Yyesiey y", u); (5.91) 
yy. yan 
y= sup f(t, y', yo... ys 
yryyt,.e ote 


The maxima and the minima of the corresponding functions lie in the 
region 


yu) <9 < yyld); 
OSE sat | (5.92) 


y(t) <y" <yf le). 


It can be shown that the functions Uh (t), y, (t) corresponding to the 
solution of system (5.91) with initial eonditions (5.90) constitute certain 
upper and lower limits (not necessarily exact) of the functions y! (f) 
corresponding to all the admissible solutions of system (5.89) with the 
same initial conditions. 

We will first examine the question of limits for the case of one first- 
order differential equation. The following proposition holds true. 
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Lemma. Consider a first-order differential equation 


g=f (t, y, 2), (5.93) 


where u is an r-dimensional vector function (a control function) taking its 
values in some region Q(t, y) of the r-dimensional space. The function 
f(t y, u) is assumed to be continuous and differentiable in allits arguments. 
The initial condition y(0)=y) is given. The solution y,(t) of the equation 


YL pint os 4» 4) (5,94) 
with this initial condition provides the exact lower limit for all the ad- 
missible values of y and every fixed ¢t €[0, 4] for the solutions of equation 
(5.93) with the same initial condition. 

Indeed, the necessary conditions for a minimum of y(¢,)=y, in the 
Mayer problem for equation (5.93) are 


Ry=0-9=—Hfy, Y=, 4); (5.95) 
R(t, yy, a= sup R(t, yy, 4); (5.96) 
ueQ(t,y) 
®,,=0=> p(t,)= —1, (5.97) 
where 
RA y D=wS (ty, +95 (5.98) 
O4,, w=wA+eh, m1); (5.99) 


p(t, y) is an arbitrary continuous and differentiable function. 

It follows from (5.95), (5.97) that p(t)<0 everywhere on (0, t;), being 
a solution of a homogeneous linear equation which is negative at least at 
one point, But then (5.96) is equivalent to (5.95), which together with the 
initial condition y(0)=yp) defines a unique solution of the equation. Since 
(5.95) —(5.97) are necessary conditions, the unique solution is of necessity 
optimal. Thus y,(¢:) is the absolute minimum or the exact lower limit of 
y(t) for equation (5.93) with the given initial condition. Since ¢,;>0 is 
arbitrary time, y,(¢t) is the exact lower limit of all the admissible values 
of y for any f¢. 

It is similarly proved that the solution of the equation 


y= sup f(t, y, 4) (5,100) 
u€Q(t,y) 


with the same conditions is the exact upper bound of y for every fixed 
t>0. 

Let us now return to systems (5.89), (5.91). If each equation from 
(5.89), say the equation for y!, is considered independently of the other 
equations, it has the same form as equation (5.93), with u(t) and the phase 
coordinates except y! treated as the control functions. The first and the 
second equation in (5.91) have the form of equations (5.94) and (5.100) in 
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relation to (5.93) and they therefore determine the upper and the lower 
limit of y! if the region of the values of y! where the maximum and the 
minimum are sought coincides with the region of the admissible values of 
y' or encloses it. 

The initial region of the coordinates is given: in our case this is the 
point y/(0)=y!. At every succeeding time, equations (5.91) define the 
limits of the admissible values of y', so that (5.92) define the region of the 
admissible values of the variables at any time, etc. 

Hence it follows that the solution of system (5.91) defines the limits of 
the components of the solution of system (5.89) at any time. 
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Chapter VI 


NECESSARY AND SUFFICIENT OPTIMUM 
CONDITIONS FOR DISCRETE CONTROLLED 
SYSTEMS 


§6.1. STATEMENT OF THE PROBLEM 


Consider two quite general sets Y and U with elements y and u, respec- 
tively, and a finite natural sequence A=(0, 1, 2,..., N). Toevery iCA 
corresponds a subset V(i) of the direct product YXU. : 

We further introduce a set D of the pairs y(i), u(i)of functions of the 
integral argument i defined over A, such that for all iC A 


(y¥@, #)EeV(); i=0, 1,..., N; (6.1) 
g@+)=f li, 9, 4h i=0, 1,..., N—-1, (6.2) 


where the function (operator) f(i, y, u) is defined over the direct product 
AXYXU, mapping it onto the set Y. It is assumed that D is nonempty. 
An element y is said to be a state of the system or its phase state; uis the 
control. The former differs from the latter in that it enters the constraint 
equations (6.2) with different values of i. 

We define a functional on D, 


N 
1=S PL yO, a), (6.3) 
feo 
where fo(i, y, u) is a functional defined on AXYXU. Let the functional / be 
bounded from below on D, i.e., 


inf /=d >—oo, 
D 


We seek a sequence {ijs({), zs(i)} ( Dwhich minimizes the functional / 
over D, i.e., a sequence such that /[¥s(i), ws(i)]—-d for S—.co. In partic- 
ular, if there exists an element (7(i), a&(i)) CD satisfying the equality 
I{¥(i), u(i)}=d (this element is called the absolute minimizing solution), the 
problem reduces to the determination of this element. 
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§ 6.2. THE OPTIMUM PRINCIPLE 


We define an arbitrary functional g(i, y) on the direct product AXY and 
use it in the following constructions: 


Ray, \=9lE+1, Fi, y, al—e, y— SG, y, 4); (6.4) 
piles supe RU, y, 4); (6.5) 
Po(y, u)= — (1, F (0, yY, u)|+ f° (0, y, u); (6.6) 
Di(y, Y=o(N, Y+H(N, y, 4); 
mo= inf (yy, w); m= inf O,(y, 2). (6.7) 
(y,u)EV(0) (y,wyEV(N) 


We will prove the following theorem. 

Theorem 6.1. (Optimum principle), Consider a sequence 
{¥s(t), Us(i)} C D. For this sequence to minimize the functional / over D it 
is sufficient, and if for all iC Athe functional f°(i, y, uw) is bounded on V (i) for 
every i€ A, it is also necessary that there exists a functional (i, y) satis- 
fying the following conditions: 


1) a function p(i) expressed by (6.5) is defined on {1, 2, ..., N—1}; 
2) for all i=1, 2, ..., N—I, 
Rli, is, Hs D> |W, S—+ co; (6.8) 
3) 7 Polys(0), us (0)] — Mo, (6.9) 
®,[ys(N), 4s (Nm, S > 00. 


Remark. If the sequence introduced in the theorem has the form 
¥s(i) =y (i), ws(i) =U(i)for all S, the condition of convergence in (6.8), (6.9) 
is replaced by equality and the pair y(i), w(i) € D satisfying the conditions 
of the theorem is the absolute minimizing solution. 

Sufficiency. We will use the lemma from $1.2. The set M of this 
lemma is identified withD, and the set N is identified with a new set E which 
satisfies all the conditions of D except equalities (6.2). On this set we define 
a functional 


N-1 


Lly 4), 4I=%o(y(0), 2) + Mi (y(N), a(N)— FRE yO, a). (6-10) 
fol 


For y(i), u(i) CD, L=/. This is quite obvious if we rewrite (6.10) in 
the form 


N 
Lily, x=! + D elé, yO— eli, FE-1), YE), aE—D). (6-11) 
tm] 


Suppose that there exists a functional @(i, y) such that the conditions of 
the theorem are satisfied on some sequence {yg(i), us({i)} © D. This sequence 
then minimizes the functional Z on E and, in virtue of the lemma, it also 
minimizes the functional / on D. The second part of the theorem — 

— necessity — will be proved in § 6.5. 
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The present theorem generalizes to the discrete case the sufficient 
optimum conditions formulated for continuous processes. If the conditions 
of §1.1—§1.2 are imposed on the sets Y and U and on the functional » and 
constraint (6.2) is represented as a difference scheme 


yé+1)—y@=aF li, y@, 2), 


where A is a positive number, we readily see that the functions R and ® of 
the theorem coincide with the corresponding functions of §1.2, apart from 
a factor A and terms of higher order in A. It is significant that the 
transition to an integral argument made it possible to reduce to a minimum 
the various mathematical concepts and constraints required for formulating 
the result. The problem is now stated in terms of general sets, operators 
defined on these sets, and functionals. 

The optimum principle formulated in this section can be applied to reduce 
the problem of minimization of thefunctional / on D to the problem of 
maximizing the functional R(i, y, u) on V(i) for every i=1, 2, ..., N—land 
the functionals @o(y, «) and M(y, u). The necessary relation between these 
subproblems is established by an appropriate choice of the functional g(t, y). 

The conditions of the theorem leave us considerable freedom in choosing 
the functional (i, y). By imposing additional constraints on g, we can thus 
develop various methods of solution within the framework of the proposed 
formalism, including discrete analogs of the continuous methods considered 
in previous chapters. We will now describe some of these analogs. 


§ 6.3. BELLMAN'S METHOD 


Suppose that for all / the set V(i) coincides with the section V*(i) of the 
class D for a given /. Let V,(i) be the projection of V;on Y, i.e., the set 
of elements y € Y each of which can be paired with at least one element u 
such that (y, u) C V(i); Vu(i, y) is the section of V(i) for a given y € V,(i). 
On V,(i) we construct the functionals 


Pi, y= sup Ri, y, a), i=1, 2,..., N—-1; 


uev,,(i,y) 
Fo(y)= inf o(y, x); (6.12) 
ueVv, (N.Y) 
Fi(y= inf O(y, 2), (=0, 1,...,N 
veV (Ny) 


and choose (i, y) so that 
1) the functional P(i, y) exists and is independent of y, 


P(i, y) =c(i), =1, 2,..., N—1, (6.13) 


where c(i) is an arbitrary function; 
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2) the functional F,(y) exists and is independent of y, 
Fi(y) =e (6.14) 


where ¢c,is an arbitrary number. 

In practice, the selection of this ¢ amounts to solving a Cauchy problem 
for the functional equation (6.13) with the initial condition (6.14) in the 
direction from Nto 0. Indeed, from (6.14) we find 


e(N, y) =— inf fo(N, Y, u) +e, 
uev, (Ny) 
and then from (6.13) for i=N—1 


¢(N—1, y= sup [e[N, F(N—-1, y, w)J— fO(N—1, y, wh, (6.15) 
u€V,,(N=1, 9) 


etc. If the functional /°(i, y, u) is bounded on V(i) for every i CA, it is 
readily seen that the solution (i, y) of this problem always exists. 
Let {as(i, y)} be the sequence of the elements u€ Vu(t, y) on which 


Rii, y, u(y, D\> Pi, y) for i=1, 2,..., N-1 (6.16) 
and 


Poly, u,(0, yl = Fy (y); 
®,[y, 4,(N, y))—> Fi(y), (6.17) 


Samo 
and let {is(t) } be the sequence of the solutions of the system 
yE+D=SI4 yO, asi, yA), 
i=0,1,..., M—1; (6.18) 
Fo[ys(0)] > mo. 


The sequence {fs(i), as(i) =ws[t, ys(i)]} belongs to D and satisfies the 
sufficient conditions of the theorem, i.e., it is a minimizing sequence. 
This method of selecting g(t, y) thus leads to a complete solution of the 
problem. 

If we take c(i)=0, and interpret @(i, y) as minus the "gain function", 
setting 


¥(0)=y0o; ¥(N) =yn, 


the functional equation (6.13) coincides with the equation of Bellman's 
optimum principle /1/. 


§6.4. THE LAGRANGE—PONTRYAGIN METHOD 


Let Y and U be finite-dimensional Euclidean spaces with the elements 
y=(y',...,y") and u=(u',..., uu’), respectively; V(i)=Vy,(t)xXVu(i); the sets 
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V,(0) and V,(N) are the fixed points yp CY and yw CY, and V,(i) with 
i=1,2,...,N—1 coincide with Y; V,(é),i=0,1,...,M, coincide with U; the 
vector functions f(i, y, u) and the functions /°(é, y, u) are continuous and 
differentiable on V(i), i=0, 1, ..., M 

In this method, the function g(i, y) is sought simultaneously with the 
extremal (y(i), u(i)) CD. Assuming (i y) to be continuous and differen- 
tiable with respect to y for every i and introducing the vector function (i), 
defined as the gradient of @ at the points of the extremal, 


v= Oe (é, yVOYly -5 (i)> (6.19) 
we write the necessary conditions for a maximum of R: 


aR , ) ; ‘ — nN ye 
— =-— — =U: 
cree eae y(i)+ > Ai, o@4+1), 9, a) =9; (6.20) 


= Ri, y, 1) =H (i, e@+1, 7H, @DI=0, (6.21) 


y= y(t) u=u(t) 
where 


Ali, y, =of i, y, —SOC y,, 4). (6.22) 


These equations are a discrete analog of the Euler —- Lagrange equations 
of variational calculus in Pontryagin's form. Together with the boundary 
condition 


y (0)= yo; y (N)= yw 
£ Pol yor HO] = ~9(1)  F(0, Yor (0) + 
+2. F910, yo, @(0)] =O; 


2 lyn HIN) = 2 PIN, gyn, N)]=0, 
us Ou 


(6.23) 


they define the extremal (y(i), w(i)) € D and (i). To bring the solution to 
completion, i.e., to prove that the pair y(i), u(i)is indeed the sought absolute 
minimizing solution, we must show that there exists a function @(Z, y) 
satisfying (6.8), (6.9), and (6.19), 

Remark. Equations (6.21) are the necessary conditions for the 
maximum of R(i, y, u) and also for the minimum of / on D/3/. Further 
note that in the discrete case, as distinct from the continuous case, the 
maximum of the Hamiltonian H with respect to u (Pontryagin's maximum 
principle) does not provide a necessary condition for the maximum of 
R(i, y, 4), nor is it a necessary condition of optimum /3/. 


§ 6.5. PROOF OF NECESSITY OF THE CONDITIONS 
OF THE OPTIMUM PRINCIPLE 


We will now prove the second part of Theorem 6.1, the necessity. 
It follows from the lemma that if there exists a feasible algorithm for 
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for the construction of the functional g(i, y) satisfying the conditions of the 
lemma, then conditions 1 through 3 of the theorem are indeed the necessary 
conditions for an optimum. Such an algorithm, in particular, is defined 
by Bellman's method if the sets V,(i), i=1, 2,...,N, coincide with Y and the 
functional f°(i, y, wu) is bounded on V(i) for every fixed iC A. 

Consider the auxiliary problem of minimizing the functional 


N 
T=PPe yO, 2) 
t=0 
on the set D of the pairs y(i), u(i). Here f°[i, y, uJ= foi, y, w) for ye Vy (i) 
and fo=K for yeV,();K=q+ sup fi, y, 4); 7 is any number satisfying 


the inequality (y.H)EV (1) 16A 
N 
sup /—inf /= su 0, y, wz) — inf °C, y, #)). 
es BP E Sia yu) aaa y yl 


The set D differs from D only in the structure of the set V(d) of the ad- 
missible pairs (y, u) CYXU. Indeed, V,(i) coincides with Y for all i CA, 
and V,,(i, y) coincides with V.u(i, y) for yC V,(i) and with U for y € Y/V,(i). 
Clearly DC D and T=! for (y(i), u(i))eD. On any element from D which 
does not belong to D, the functional J assumes a value which is greater 
than its value on any element of D. Indeed, suppose that the condition 
(y(i), u(i)) © V(t) breaks down for 1!<m<N+1 values of 1. Then 


7 —sup / > mq+inf/—sup/ >(m—1)q>0. 
D Ee E 


This signifies that starting with some S=s all the elements of the mini- 
mizing sequence of the functional 7 lie in D, i.e., {Js(i), Hs(t)} ¢ D for S>s. 
Since T=/ on D, this sequence is also a minimizing sequence for the 
functional /. 

Let the functional (i, y) satisfy the conditions of the theorem for this 
problem. The functionals R(i, y, u), Mo, and M, corresponding to this (i, y) 
coincide on V(i) with the analogous functionals of the original problem, and 
since V(i) C V(i), they go on the sequence {fs(i), Zs(i)} C D to their largest 
(smallest) value on V(i), i.e., the functional p(i, y) satisfies conditions 1 
through 3 of Theorem 6.1 for the problem of minimizing / on D also. Since 
the problem of minimizing Ton D belongs to the type of problems for which 
the existence of the functional (i, y) is a necessary condition, it is alsoa 
necessary condition for the original problem. 
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Supplement 


THE SIMPLEST FUNCTIONAL ON THE SET OF DIS- 
CONTINUOUS FUNCTIONS AND FUNCTIONS WITH 
A BOUNDED DERIVATIVE 


The calculus of variations arose back in the 17th and the 18thcenturyasa 
branch of mathematical analysis concerned with the search of objects (functions, 
curves, surfaces, etc.) on which a given integral attained its minimum or 
maximum value. Because of the then prevailing concepts in mathematics 
and the first particular applications of the theory (the brachistochrone 
problem, the problem of light refraction, the problem of isoperimeters, 
and later the least action principle), it was naturally assumed that the 
minimum (or the maximum) would be attained on a smooth or, in the worst 
case, piecewise-smooth continuous curve, provided the lower bound of the 
functional existed. This conviction persisted until the time of Weierstrass, 
and it actually served as a basis for Lagrange's method of variations, which 
reduced the problem of the extremum of a functional to a boundary-value 
problem for differential Euler—Lagrange equations whose solution was 
followed by a verification of a whole gamut of necessary and sufficient 
conditions. After Lagrange, considerable contributions to the theory were 
made by Legendre, Jacobi, Weierstrass, and Hilbert, who developed 
Lagrange's method to a stage when it becamea highly refined, powerful, and 
often irreplaceable tool of analysis, which to this day constitutes the 
fundamental algorithm for the solution of variational problems. 

However, Weierstrass in his famous disputes with Riemann produced 
numerous examples showing that the minimum of a functional /(u), where 
u is an element of some set U, is not necessarily attained on the elements 
of that set. In particular, ifu@C,, the minimizing solution does not always 
belong to the class C; of continuous smooth functions y(t) and its derivative, 
and even the function itself, may have discontinuities. Moreover, as we | 
shall see below, the function y(¢) may fail to represent a curve. Hence it 
follows that by confining the analysis to C; we cannot obtain a conclusive 
solution of the problem regarding the absolute minima of functionals. 

This development was apparently responsible for Hilbert's unconventional 
approach to the problem of the absolute minimum of the functional 


B 
f=[ Fiy, y’, s)ds. (S. 1) 
A 


154 


He rejected the entire refined, sophisticated, and rigorous classical 
apparatus of the method of variations and defined the functional (S, 1) on the 
set of all rectifiable curves. He then proceeded to construct these curves, 
selecting those whose length — the functional /— approached a lower bound 
(a minimizing sequence). As the next stage, he proved the existence of a 
limit curve and established its extremal and functional properties. This 
approach constituted the prototype of a new algorithm, one of the so-called 
direct methods of variational calculus. We will concentrate on two trends 
among the various ideas prevailing in this field. 

1. The theoretical-functional trend. This trend is mainly 
concerned with the existence of an absolute extremal and the determination 
of its functional properties. A characteristic feature of this trend is the 
large-scale application of the theory of functions of a real variable. The 
problem of minimum is considered on the set of absolutely continuous 
functions, using Lebesgue integrals. This trend reached its peak in the 
1920—1930's in the work of Tonelli and his school in the West and Krylov, 
Bogolyubov, and Lavrent'ev in the USSR. Tonelli /17/ showed, for a 
number of important particular cases, that the absolute extremum is 
attained in the class of absolutely continuous functions. Lavrent'ev /10/ 
proved that the problem of the absolute minimum of the functional 


b 
J=l F(t y, y)at (S, 2) 


becomes meaningless if the class of admissible functions is extended to 
include all functions of bounded variation. He also derived the sufficient 
conditions for the lower limit of the functional (S. 2) in the class of absolutely 
continuous functions to coincide with its lower limit in the class of 
continuously differentiable functions. Later, this result was strengthened by 
Tonelli. Under certain additional conditions imposed on F(t, y, y’), the 
minimizing solution y(t) has almost everywhere a first and a second derivative, 
provided it belongs to the class of absolutely continuous functions. 
Bogolyubov /1/ generalized Tonelli's results. We thus see that the 
theoretical-functional school produced quite significant results. However, 
the fundamental problem of this school ~ namely the problem of existence 

of the absolute extremum and the functional properties of extremals — 
remains on the whole unsolved even for the simplest functional (S.2), not 

to mention the practical construction of extremals. It is not clear under 
what conditions the extremal belongs to the class of absolutely continuous 
functions and when it does not belong to this class and, if the latter applies, 
whether or not the class of the admissible objects can be extended so as 

to include the objects on which the minimum is attained. 

The material of the following sections indicates that these problems 
cannot be solved at all by traditional theoretical-functional methods. After 
all, the choice of the class of admissible objects is prescribed not by the 
inner logic of the variational calculus but by entirely extraneous factors 
stemming from the elements of the theory of functions of a real variable. 
Thus, the absolutely continuous functions are adopted as the class of ad- 
missible objects because the Lebesgue integral is defined on this class. 

But this is clearly of no relevance for the fundamental processes of the 
calculus of variations. 
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The above considerations reveal the limitations of the theoretical- 
functional methods, but in no way detract from their value, since ina 
number of important cases they provide the best and, possibly, the only 
tool for the solution of the problem of the existence and the properties of 
absolute extremals. 

2. The second trend in the theory of direct methods can be described as 
the applied trend. The determination of a minimizing sequence 
directly involves the sought function on which the extremum is attained and 
calls for approximating this function by a suitable sequence. If a minimizing 
sequence can be effectively chosen, we automatically prove the existence 
and often approach (although purely theoretically) with any desired accuracy 
to the solution of the problem. Since the Euler—Lagrange equations are 
analytically solvable in exceptional cases only, this direct approximation is 
generally more effective than the solution of the corresponding boundary 
problem. The second trend is thus mainly concerned with the various 
aspects of the particular choice of minimizing sequences and their 
convergence. 

Historically, the first method of construction of a minimizing sequence 
was developed by Euler (his finite-difference method). After two centuries 
in oblivion, it was resurrected and rigorously developed by Soviet mathema- 
ticians, mainly Lyusternik /11/, and also Petrovskii, Krylov, Bogolyubov. 
In the West, this method was taken up by Courant's school /8/. 

A powerful direct method for the solution of variational problems is 
Ritz's method, first advanced in /16/. Ritz considers an n-parametric 
family of functions y,(t, a), a= {a\, a2,..., dn}. On these functions, the 
functional /(y)is reduced to a function of a finite number of variables. The 
extremum /(¥yn) =/n is found by determining the coefficients from the 
equations 


Oa i 


In ordinary problems, the sequence of these extremal functions y,(t) was 
assumed to go in the limit to a function extremizing the functional /(y). This 
method found wide applications in a variety of problems. Poincaré referred 
to it as a method for the engineer, thus stressing its outstanding applied 
importance. 

Very extensive literature, both Soviet and Western, is currently available 
on Ritz's method. Of particular merit is the study of Krylov and Tamarkin 
/9/ who established a theoretical foundation for Ritz's method by proving the 
convergence of the minimizing sequence for a wide range of important 
applied problems. 

The complete solution of the variational problem by the direct method 
is obtained by a combination of the two schools of thought and includes the 
following stages: 

1. Construction of a minimizing sequence {yp (t)}. 

2. Proof of the convergence of {y,(¢)} to a function y(t) which belongs 
to the class of admissible curves. 

3. Proof of the convergence 


inf /(y(t))=1im (y, (0). 


y(t) n+ 
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Significant difficulties are encountered already in the first stage, since 
we have to solve a system of n finite equations in m unknowns for every fixed 
n—oo, Since these equations, as a rule, are nonlinear, new solutions may 
arise at every successive stage, greatly complicating the situation. In 
general, a direct solution of the variational problem is more involved than 
the solution by the method of variations, since in addition to obtaining the 
extremal we at the same time solve the more general problem of its 
approximation by some given system of functions. 

However, the solution of problems raised by Weierstrass's discovery 
does not necessarily follow Hilbert's path: the method of variations can be 
improved and generalized to a wider class of admissible curves. In this 
category we have the derivation of the well-known Erdmann—Weierstrass 
conditions at the corner points of the extremal and the theory of deeper dis- 
continuities developed by the Soviet mathematician Razmadze /13/. 
Razmadze proceeded from Weierstrass's well-known example: the integral 


+1 
IT=( Py'dt 
J 


with the boundary conditions y(—1)=—1, y(1)=1 has a zero lower bound. 
Indeed, the value of this integral on the family of curves 


t 
tan7 


joes 
sie, 


tan 
€ 


goes to zero for e—0, but the lower bound is not attained for any continuous 
curve. In other words, the problem has no classical solution. The limit 
of the family y(t, e) for e—-0 is the discontinuous function 


—1 #<0 


y(t) CSO 
and /(¥y)=0. Thus, if the class of admissible functions is extended to 
include functions with a discontinuity of the first kind, the integral can be 
minimized. Razmadze raised a general question: in what cases a problem 
which is unsolvable in the class of continuous functions has a solution on a 
wider set of curves with one discontinuity point. 

An integral of a discontinuous function is defined as a sum ofthe integrals 
over the continuous sections: 


to—0 & 
{Fy yiatt fF y y)dt, (S. 3) 


to+0 


& 
M(y=l FQ y yat= 


where ‘¢) is a point of discontinuity of the first kind of y(#). The functional 
of the discontinuous function y(/) defined in this way satisfies the condition 


/(y)=lim J (c,), (S. 4) 


> co 
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where {c,} CC, is an approximating sequence of continuous smooth functions, 
only if we have at the discontinuity point to 


‘ 1 
thy y, 0, 1)=lim pF ft, y, —)=0. : 
Slo y 1) wed (4 y 7) (S. 5) 


The necessary conditions to be satisfied by the extremal at the dis- 
continuity point are derived, 


Fy l—o= Fy li 9=0: 


(S. 6) 
Fl,,-o=F lt, 40- 


After that, the theory of sufficient conditions is developed. 

In general, when condition (S. 4) does not restrict the class of admissible 
curves, the discontinuity is "floating", i.e., not fixed. In particular cases, 
when condition (S, 5) is satisfied only in isolated points of the segment 
{a, 6], the discontinuity point is fixed (Weierstrass's example belongs to 
this category). 

Further development of the method of variations along the lines laid down 
by Razmadze was undertaken by Nikoladze /12/, Ermilin /2, 3, 4/, Kerimov 
/5,6,7/. Krylov generalized the fundamental lemma of variational calculus 
using Razmadze's results. 

Razmadze's main achievement, in our opinion, is the inclusion of curves 
containing a finite number of vertical segments among the admissible 
curves. This generalization in itself does not resolve the difficulty, but it 
constitutes an important step forward toward the application of a new 
algorithm, described below, which ensures a complete solution of the 
problem of the absolute minimum for a wide class of functionals. This 
algorithm includes Razmadze's general case as a particular case and 
essentially advances the solution of the problem for functionals not included 
in this class. Particular results of Razmadze's method and the entire 
theory of necessary and sufficient conditions lose much of their value in 
this case, since both the absolute and the strong local minima are attained 
on the (y, z2) minimals constructed by this algorithm and only on them. 
Razmadze's theory retains its original value only for the particular case of 
fixed discontinuity points. 

Another, although less significant, point to be remembered is that 
definition (S. 3) of a functional on a discontinuous function is by no means 
unobjectionable. It suffices to note that this definition is meaningless even 
for the simplest problem of a curve of minimum length, since it ignores 
the length of the vertical sections. 

A more general and more natural definition ofa functional on discontinuous 
functions will be given below. It will include, as a particular case, those 
problems for which definition (S. 3) is meaningful. 

A new method is proposed for the solution of variational problems. This 
method establishes the existence of a new class of minimizing solutions, 
which are no less typical then the classical Euler—Lagrange extremals, but 
are of fundamentally different nature. Unlike the Euler—Lagrange extremals, 
these new extremals are not solutions of any boundary value problem. Their 
finite equations are written directly in the form of the necessary conditions 
of extremum. The new minimizing solutions are not necessarily functions: 
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they may belong to an entirely new class of objects, called (y, z) curves. 

If z(t)= y’(t) everywhere, the (y, z) curve is an ordinary piecewise- 

smooth function. If, however, y’(t)z(t), the (y, Z) minimal is not 

a function, but if it is known, i.e., the pair of functions y(t), z(¢) are known, 
a minimizing sequence of piecewise-smooth curves approximating to the 

(y, Zz) minimal is also known. The new method is considered for the partic- 
ular case of the simplest functional 


r= Fee y, y')dt, (S. 7) 
y(a)=ay, y(6)=b,. 


This functional is traditionally used as a touchstone for all new theories 
in variational calculus (in many cases it is sufficient to reject a theory), 
and more than half the publications on the subject are concerned with it. 

If the function 


tlt, y P, N=PF (4, 4, —) 


for p40 exists and is continuous in p in the (f,y) plane, the proposed 
algorithm gives a complete solution of the problem of the absolute minimum 
of the functional (S.7). In this case, the minimum is attained on the above- 
mentioned minimals of the new type. An exceptionally simple necessary 
and sufficient condition of minimum is derived in the form of a minimum 

of some known function S(t, y, z) with respect to y and z for every fixed 

t€ [a, 6]. In other cases, the minimum may be attained both on the (y, z) 
minimals and on the classical Euler—Lagrange extremals. 

Next we consider the important applied problem of the minimum of the 
functional (S. 7) on the set of functions with a bounded derivative, i.e., 
functions satisfying inequality constraints. It is shown that in this case we 
are again dealing with minimals which are analogs of the (y, z) minimals 
of the simplest functional, but the role of the vertical directions is assumed 
by the limit directions 


y=h(t y) or y’=frlt, yg). 


Under certain conditions, the minimum in this problem is attained on 
regular Euler extremals, 

The new method is applied to solve some modern variational problems 
of applied mechanics. These solutions are of independent interest and, 
moreover, provide an excellent illustration of the application of the new 
method and the actual form of the intangible (y, z) minimals. 


Some introductory propositions 


Let a functional / satisfying the condition 


inf /(u) > —oo (S. 8) 
ueM 


be defined on some set M, 
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We are looking for the absolute minimum of /(u) over M, i.e., for an 
object % which satisfies the equality 


/ (u)= inf /(u). (S. 9) 
4uEM 
The elements of the set M do not necessarily include the element u on 
which the lower bound is attained. In this case, we shall try to embed M 
in a wider set M) Min which the minimum element i is contained, 
appropriately extending the definition of the functional / on the set M. 
The extension /;, (the definition of / on M) only has to meet the following 
requirements: 
1) on the elements of M it coincides with the original functional, i.e., 
ly (4) =Im(u), «CM; 
2) ifu@C Mthere exists a sequence {un} C M such that 
1% (4)=lim (u,). (S. 10) 


nw 


If a suitable set M is given (in the particular case u €M it coincides 
with M), the element 7 € M minimizing the functional / (u) on M has been 
determined, and the sequence {un} ( M satisfying the condition 


1 (u)=lim/ (a,) (S. 11) 
n—> 
has been constructed, we consider the problem of the absolute minimum 
of the functional on the set M solved. 

The element z ¢ M is called the absolute minimizing solution (or the 
absolute minimal) of the functional /onM. The sequence {uz,} isa 
minimizing sequence, and the set Wis an /-extension of the set M. 

Lemma. Let 2€M satisfy (S.9). Then 


/ (u) = inf /(u). (S. 12) 
u€M 


Conversely, if 7 CM satisfies (S.12), (S.9) holds true. Let (S. 9) be true, 
and suppose that (S. 12) is not satisfied. Then there exists an element 
vu €M such that /(v)</(%). We write 


I(a)—I(v) =p>0. 


In virtue of the definition of / on M, there exists a sequence {u,} CM 
such that |/(v)—/(un)|<P for sufficiently large n and therefore /(un)</(z). 
The last inequality contradicts (S.9), however. Let(S.12) be true. Since 
M (CM, we have /(z)</(u), u@ M. On the other hand, according to the 
definition of /on M, .there exists a sequence {u,} ( M such that /(u,) =/(z). 
Hence, in virtue of the definition of the lower bound, we obtain (S, 9). 

This lemma shows that the minimals of the functional / on the sets M 
and M coincide, so that instead of minimizing the functional on the set M 
we may minimize / on M, if this presents any advantages. 
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§S.1. THE SIMPLEST FUNCTIONAL 
1.1. Statement of the problem 


Consider the minimum of the functional 
b 
M(a)=[ Fy y)dt (S. 13) 
a 


on the set U of curves with the following properties: 

1) the coordinates ¢ and y of the points of the curve u € U may be re- 
presented as continuous functions of some parameter k ; 

2) the function y(t) is continuous along the curve u and is single-valued 
everywhere on [a, 6], except a finite set of points {y;} (i=1, 2, ..., &), 
where it may have discontinuities of the first kind; 

3) the derivative y’(t) of the function y(t) is continuous and bounded on 
the set [a, 5]/ {pi}; 

4) the function y(f) satisfies the condition 


y(a) =a); y(6) =b,, (S. 14) 


where a, and 6, are known; 

5) the curves u €U lie in the region B on the (#, y) plane where the 
function F(t, y, z)is continuous in all the three arguments together with the 
derivatives F;, Fy, F, for any z; 

6) the upper and the lower boundaries I;(4)and I2(t) of the region B, if 
they exist, have the properties 1—3 of the set U. 

Definition 1. The right-hand side of (S.13) is interpreted as a line 
integral along the curve u in the direction from the point A(a, a,) to the 
point B(b, 5\), i.e., 


8 . 
M(u={ F(t yt yak, (S. 15) 
where 
= 7). 
F(t, y, Py g= pF (t, y, a) (S. 16) 


k is the parameter taking the values k=oa for t=a, y=a, and k=$>a for 
t=6, y=b, which increases along u from A to B. 

In virtue of the above properties of the function F(z, y, z) and the set u, 
the latter contains curves on which the functional is finite. For the problem 
to be meaningful, we have to assume further that 


inf J(u) =m>— oo. (S.17) 
ugU 
Taking dk=dt for pi<t<piyi and dk=|dy| for t=p;, i=0, 1, ..., k&, where 


Po=a, pta=b, we obtain from Definition 1 
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n—1 n 


wM=> T+D Olen ved, ¥ Hd), (S. 18) 
i=0 t=0 
where 
H4570 
pea Fdt: (S,19) 
p,+0 
¥ — 
Ot, y, =| WE, &, sign(y—y)) ak; (S. 20) 
i 
y=y(t+0); y=y(t—0): (S. 21) 
Wit, y, sign(y—y))=lim f(t, y, p, 1) 
|p| 0, sign p=sign(y—y). (S. 22) 


Because of the properties of the function F(t, y, z) and the set U, the integrals 
Jhalways exist. Therefore /(u) exists on those and only those curves u €U 
for which the integrals (S. 20) exist on the vertical sections (if any). 
Let the curve z€ U be the minimal of /(u)in U, i.e., #(z) = int (4). By 
ue 


(S.17), /(%) exists and the existence of the integrals D(ui, yi, ¥:) on the 
vertical sections therefore may be regarded as the first necessary condition 
to be satisfied by the minimizing solution uw. 

This one condition enables us, in some highly important particular cases, 
to assess the qualitative character of the minimum and, in particular, the 
possible existence of discontinuities and their position, 

The general case in our treatment is such that the condition of existence 
of M(t, y, y) at the points of discontinuity does not impose any additional 
constraints on the class of admissible curves, i.e., @(t, y, ¥) exists for 


all t, y, y CB. 


1.2, Relation to the problem of the absolute minimum 
in the class of piecewise- smooth functions C, 


In the class of piecewise-smooth functions C;, Definition 1 coincides 
with the conventional classical Riemann integral /(u),u€ C). 

Let the object 7 € U be the absolute minimal of / on U, where U is some 
/-extension of U. The set Uis the /-extension of the class C,if for every 
curve u€ U there exists a sequence {c,} ( C, such that 


/,(4}=lim /(c,) (S. 23) 


Nw eg 


Here the subscript u indicates that the functional is taken in the sense of 
(S.15). Indeed, in virtue of the definition of the /-extension, for any u EU 
there exists a sequence {un} ¢ U satisfying (S.12) and, therefore, by (S. 23), 
also a sequence {c,} C C, such that 


1;(u)=l1im / (c;). 


N->co 
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By our lemma, the absolute minimal u of the functional / in u is therefore 
also the absolute minimal in the class C,, i.e., 


! (u)=int / (x), (S. 24) 
BEC, 

if for every u € U there exists a sequence {cn} ( C, satisfying (S. 23). 

We will show in what follows that this condition is always satisfied in the 
general case. In the so-called particular cases, it is not always satisfied. 

If the absolute minimal in U is also the absolute minimal in C,, we will 
refer to it as the absolute minimal of the functional / (without mentioning 
any classes). 


1.3. The general case 


The general case in our terminology is such that the condition of 
existence of M(t, y, y) on the vertical sections does not impose any additional 
restrictions on the class of curves, i.e., M(t, y, ¥) exists for all f, y, y. 
This implies that for every fixed iC [a, 6], the two functions of a single 
variable j[t, y, +0,1] should be summable on any segment [y, 9] € [Te(t), T(4)]. 

In what follows, we assume that both functions f (1, y, +0,1) moreover 
exists and are continuous with their derivatives f, and [, everywhere in B. 

We will see below that the character of the extremals is determined by 
the properties of the function f(t, y, p, 1) for p=0, specifically, by whether 
the function f(t, y, p, 1) for p=0 is continuous or discontinuous in p. Both 
these cases are of the greatest significance both from theoretical and applied 
considerations, and we will consider them separately in some detail. 

The system of definitions and theorems introduced below leads to a 
complete solution of the problem of absolute minimum of the functional 
(S, 13) in the first case and discloses a number of highly valuable new facts 
in the second. It admits of very extensive generalizations and provides a 
foundation for all further constructions (Figure S. 1). 

Consider the set of polygonal lines , {y,}] CU, described by the functions 
y.Q=y +g (t—t) for tp. t<tiy,, =0, 1,..., 2—1, where 


a=ty< ty... Shy St 


are the abscissas of the discontinuity points of y,(#), and yi, y; are 2n in- 

dependent parameters defining the polygonal line yn for any fixed partition. 
Definition 2, Consider a line u CU defined by the function y(t). 

We shall say that the sequence {yn} approximates the line u, or {yna}—u, 

if naw 


yi=y(ti+0), (S. 25) 
and for any e>0, 


trar—fl <8; [yp — yf +0) <e (S. 26) 
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for n>N(e). Here i=0,1,...,n—1!. In this sense, f{y,} is everywhere dense 
on U,. 

Definition 3. Consider a pair of functions y(t) € U and 2(t) which are 
defined, bounded, and continuous on the set [a, 6]/{ui}. We construct the 
sequence {y,} so that 


yi=y (ti +0) (S. 27) 
and for any e>0 
lta —tl<s [yi — z+ 0i<e (S. 28) 


FIGURE S.1 


for n>N(e), i=0, !,..., m—l. In this case we say that the sequence {yn} 
approximates the (y, z) line up, or {yn}—-Uuo. The set of these (y, Z) lines 
is designated U). The function y(¢)is the zero closeness function of the 
(y, 2) line up and 2(t)is the local slope of this line. 

If 


2(t)=y'(t) (S. 29) 


almost everywhere on [a, 6], we have uw CU. Thus U CU. 
Definition 4. The functional (S.13) of a (y, z) line is defined as the 
limit 
/ (4o)=lim J (y,). (S. 30) 
{Tp} 70 


Theorem 1. Ifthe function f(f, y, p, 1) exists and is continuous in p 
for p=0 and for any ¢, y € B, the functional /(u), uC Uo, exists and is 
expressed by the formula 


A—1 pb; 4 1-0 k _ 
F(u)=S { (Fen a+WE Wy Za + ¥ OO yor Wd) (S. 31) 
{mo #,+0 i=0 
or by 
6 
T (ua)=(R) J S(t, y, 2) a+ (6, by, © (&)— @(a, ay, ¢ (a), (S. 32) 
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where 
y 
St, y 2)=Fi 9 DW y) (2—C WO) — Fit, Da, (S. 33) 
e(t) 
where c(t) € C, is an arbitrary function; W(t, y) and O(é, y, c) are defined by 
(S. 20) and (S. 22); p; are the discontinuity points of y(t), i=0, 1, ..., &. 


The symbol (R) indicates that a Riemann integral is meant. Since 
f(t y, p, 1) is continuous for p=0, 


WL yI=W(t y —l)=W(t y) =f(t y, 0, 1). (S, 34) 
Using (S. 20) and (S. 34), we may write 
O(t y, ¥)=D(t, y, c)—O(4, ¥, ©), (S. 35) 


where c(¢)is an arbitrary function. In what follows we take c(t) EC. 
By (S. 18) and (S. 35), 


a—) 


1 (vq) = J Sidt + (6, by, ¢ (6))—¥(a, ay, ¢ (a), (S. 36) 
1=0 
where 
try 1-0 
S,At, = 3 Fé, Yns Yn) dt+ ® (iz, Yu c)— 
t/+0 = 
—OFai Yrs ©) 
t; are the discontinuity points of the function y,(#), i=0, 1, ..., a—l, or 


using Lagrange's theorem of finite differences 


Si=F fh, on (fi, yn (N)—O' (fi, oq (td), C(fD) = 
=F Ut, y(t), yn(én|—W (th, 9 (42) [ya (tt) — 
y(t?) 
—e'(N— f Wye at; (S.37) 
c(t*) 
tr=t, + ato, O0<O0<1. 
Using (S. 27) and (S. 28), we may write 


n—1 


1 (4)= SF Slt, yl), 2) 64 +0(2) + 
4.0(6, b,, ¢(6))—®(a, a, ¢(a)). (S. 38) 


Taking the limit y,—-up in (S. 30), some terms in (S. 38) go to infinity, 
and |éi41—ti] 0. 

In virtue of the properties of F(t, y, z), the function S[t, y(t), 2(t)]is 
continuous in all its three arguments in region B of the (t, y) plane for any 
finite z. Since y(t) and z(t) are continuous almost everywhere and bounded 
on [a, 6], Slt, y(t), z(t)] is also continuous almost everywhere, so that by the 
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Lebesgue theorem, the function S(#, y(t), 2(t)) is Riemann-integrable. This 
indicates that for {yn}—-u the sum (S. 38) has a limit, i.e., it is independent 
of the particular choice of the sequence {y,}—-uo (independent of the partition 
{t; } and the position of the points ¢7). 

We proved the existence of the functional /(uo) on any (y, 2) line up € Uo 
and the validity of (S. 31). Similarly, starting with (S.18), we prove the 
validity of (S. 31) (this formula is proved in a more general form in 
Theorem 2). Q.E.D. 

Theorem 2. If the function f(t, y, p, 1) has a discontinuity of the first 
kind in p for p=0 for any (¢, y)€B, the functional J(u), u € Uo, exists and 
is expressed by the equality 


RV Bj 4 170 
1(w)=S f (FA 24+ y, sign (y'—2))]X 
1=0 pw, +0 
X(y—2)dt +S f Ws, sign(y—y)at (S. 39) 


i=0 y, 


or by the equality 
&b 
I (u)=(R) { S(t, y, 2, sign(y'—z))dt+ 
a 


+3 le (ti, Yy, ¢, sign (y’—2))— ®(t;, 4, ¢, sign(y’ —z))]+ 


jel 
+ [b, 5, c, sign(b,—y(6))]—®[a, ay c, sign(y(a)—a,)], (S. 40) 
t; are the points where the difference y’—z reverses its sign, j=1, 2, ..., r. 


By (8.18), (S.27), and (S. 28), remembering that y(t), z(t) are continuous 
and bounded, we have 
a—l f¢. 


+179 
d= 


ico ! 


Fb de 2)a+D (Wit, y, sign (y—g)ldy= 
0 i=0 y 
n—! 
= > {F(é, y(t, 2(4))+ 
i=m0 
+W Iti, y(é), sign(y’ (t)—z (f)+0(e))} + 0(¢)) (441-4, + 
a i) 


+ J Mb & sign(y,—y)] a +000). (3. 41) 


i=0 y(H;) 


In virtue of the properties of the functions F(t, y, z), W(t, 4, +1), y(4), 
and 2(t), the function 


F(t, y(t), 2(t)) + Wt, y(t), sign (y’(t)—2(t))] (y’(t)—2(#)) 


is bounded and continuous almost everywhere in the intervals (mi, pit). 
Hence, by the Lebesgue theorem, it is Riemann-integrable, i.e., the 
limit (S. 30), where /({y,) is expressed in the form (S, 41), exists and is 
independent of the choice of the sequence yn. 

The validity of (S. 40) is proved along the same lines as Theorem 1. 
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Corollary 1. The functional /(uo), uo € Uo, depends ontwo independent 
functions y(t) and z(t) whose derivatives either do not enter the integrand, 
as in (S.31), or only enter as sign (y’—z), as in (S. 40). 

Corollary 2. It follows directly from expressions (S. 31) and (S. 39) 
for /(uo) that the functional is continuous on the set Up of (y, z) lines in the 
sense that 


\7 (a) — 1 (tol <e, 
if (S. 42) 


ly—yO<n(e); lz(9—z(d| <n) 


everywhere on [a, 6], except the n-neighborhoods of the points ; where 
y(t) is discontinuous. 

Corollary 3. It follows from (S, 32) and (S. 40) that Definitions 1 and 
4 coincide on the set of y(t). 

Corollary 4. It follows directly from Definition 4 and Corollary 3 
that the set Uy is an /-extension of the set U. 

Remark. Expression (S. 40) for /(u), uC Uo, is not defined on ordinary 
smooth curves, i.e., for y’(t) =z(t), The continuity of the functional on Up 
shows that its definition can be extended for these curves if we take for 
y’—z=0 


sign(y’—z)=1, or sign(y’—z)=—1. 


In this case, expression (S. 40), like (S. 39), is defined over the entire 
set Up. 

We will now show that U, and also Up, are J-extensions of the class C, 
of piecewise-smooth functions, i.e., for any line u CU there is a sequence 
{en} © C, such that 


1,(a= lim 1 (¢,). (S. 23) 


Consider some line uC U. We replace all the vertical segments of the 
line u« to segments making an angle of - to the vertical which pass through 
n 


the midpoints of the vertical segments. Let n>0 if the deflection from the 
vertical is in the clockwise direction, and n<0 if the deflection is counter- 
clockwise. The deflection should be such that the resulting line ¢, corres- 
ponds to a single-valued function yn(t), i.e., 


sign A = sign [y(u; +0) — y(v,—9)]. (S. 42a) 
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Clearly {c,)CC,; then 


- #+0(=) pati? (z) 

+2 aa Ft, y, n)at= oil) Fat+ ee 
a a(rees)) 

+y) f F(t, y, 2)— dy; 


 ora(2) 


signa=signdy. 


Taking the limit n—-oo in (S. 43), we obtain on the right expression (S, 18) 
for /(u) in the sense of Definition 1, Hence, for any line uC U, there 
exists a sequence {c,} ( C, which satisfies (S.23). Therefore, the absolute 
minimal on U is at the same time the absolute minima! of (S. 13) on C,. 

In our definition of Up, we imposed the following conditions on its 
elements, (y, z)lines: y(t) CU and z(t) is bounded everywhere on [a, 6] and 
continuous on [a, 6]/{ui}, where {p;} is a finite set of points. 

Definition 5. We say that the sequence of (y, z) lines {un} € Uy goes 
to a (y, z) object u C Uoif for fixed ¢, almost everywhere on {a, 6], 


y(t)=lim y, (4); 


(S. 44) 
z(t)=lim z, (4). 


fi-r oo 


The last equalities are to be interpreted in the following sense: if y(t) 
is finite at a point ¢, then for any given e>0 there exists N such that 


jy()—yOl<e for 2>N. (S. 45) 


If y(t) =+00, then for any M>Oor, respectively, M<0Q, there exists 
N such that for n>N 


yn(t) >M>0, (S. 46) 
or, respectively, 
Yn{t) <M<0. (S. 47) 
The convergence for z,(t) is similarly interpreted. 


The set Uo of (y, z) objects is called the closure of Uj). Evidently, U is 
an /-extension of Uo. 


168 


Definition 6, The functional(S.13) on an object 2€ J, is defined by 
the equality 


{(a)= lim [(4,). (S. 48) 
(Jr {4n} Uo 


Here /(un) is expressed by any one of the equalities (S. 31), (S.32), (S. 39), 
(S. 40). 

Let us now consider separately the two cases when f(t, y, p, |) is 
continuous at p=0 and when it has a discontinuity at p=0. 

1. The function f(t, y, p, 1) is continuous in p for p=0 and for any (¢, y) CB. 

The problem of the absolute minimum of the functional (S, 13) in this case 
is completely solved by the following theorem. 

Theorem 3. The absolute minimum of the functional (S. 13) is 
attained on a (y, z) object 2€Up which satisfies the condition 


S(t, g,2)= "int S(t, y, 2) (S. 49) 


T(t) <y(t) <0, (t); oe <zseoo 


for fixed f almost everywhere on [a, 6] and only on this object. 
Proof. We will first prove the theorem for the absolute maximum on 
Uy. By (S.17) and the lemma we have 


inf / (4)=m > — oo. (S. 50) 


ué 


According to the definition of a lower bound, there exists a sequence 
{un} C Uo such that I(un)—m, I(un)<m. Expression (S, 31) for /(u), uw G Up, 
awe 


and (S. 50) directly show that: 
1. The lower bound of S{é, y, z) exists everywhere on [a, 5], 


j= inf S(t, y, 2) 


T(t) <y<T,(t), —e<z<too 


and since S(#, y, 2) is continuous in y and z, there exist 7(f) and Z(t) on 
which S=((t). 
2. On the sequence {u,} for n— oo, 


S(t, Yn Zn) >l() and y,(t)> y(t), Zz, () > z() 


in the sence of Definition 5. Since (yn(t), zn(t)) € Uo, the (y, z) object % de- 
scribed by the pair (y(t), Z(t)) belongs to Up. 

3. From the definition of the lower bound and from item 2 above we 
have 


b 
m= lim /(4,= lim S(t, Ya Z,) at-+ const. (S. 51) 
a a 


now, {u,)6Uo Unlt) H(t): z(t)-r2(0) 4 


By Definition 6, the last expression is the functional /(u), 7 CU. 
We proved that the absolute minimum of the functional (S. 13) on U is 
attained on the (y, 2) object 2 € Uy satisfying (S. 49) and, by item 2, only 
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on this object. The theorem is thus valid for minima on Uj. Since Uo, 

and hence Up, are /-extensions of U and C,, our lemma shows that the 

theorem remains true for the absolute minimum also. Q. E.D. 
Corollary 1. It follows from (S. 49) and the continuity of S(t, y, z) and 

its partial derivatives, that if the zero closeness function y(t) of the 

minimal @ lies inside the region B, and the local slope z(t)is finite, then 

almost everywhere on [a, 6] 


S,= Fy—Wy-z—-W,=0; 
y y y t (S, 52) 
S,=F,—W=0. 
Corollary 2. If the classical Euler—Lagrange extremal 7(t) which 
is the solution of the equation 


4! (y(t)) =0 


in the class C, of.continuous piecewise-smooth functions does not satisfy 
(S. 49) (Z=7’ in this case), the functional / either has a weak local minimum 
in C, on this extremal or has no minimum altogether. No strong local 
minimum in C;, and certainly no absolute minimum, can be attained on this 
extremal. 

Discussion. Condition (S. 49) defines a (y, z) object % on which the 
functional (S.13) attains its absolute minimum, i.e., it defines the zero 
closeness function y(t) of the functional and the local slope Z(t). If we find 
that 7(t) C U and y’(t) =z(t) everyWhere on [a, 6], except a finite number of 
points, then # CU. Otherwise, the minimal 7€U , but the solution 
y(t), Z(t) defines a minimizing sequence {yn} @U. If%€ Uo, the minimizing 
sequence {yn} ¢ U is constructed as indicated in Definition 2, i.e., we 
construct the polygonal lines y, so that 


w= y (ti); yi =2(t,), i=0, 1, 2p shy n—1. 


Irrespective of the partition of the segment [a, 5], 


I(y,)—> 1 (a). 


fiw 


If uC Up, we first select a sequence of pairs {Ym(t), Zm(é)} C Uo which 
goes to uw in the sense of Definition 5, and on each element of the sequence 
we construct a sequence of polygonal lines {ynm}— Um EUs in the sence of 
Definition 3. Any sequence {y*} selected so that m—oo, n—oo for k—oo is 
a minimizing sequence. 

Thus, if we found the extremal pair j(t), z(/) we have all the elements 
of the minimizing sequence. 

If the minimal 7€ U, the minimizing sequence is of special importance, 
since the minimal itself cannot be constructed. 

Let us investigate the minimals in more detail, assuming additionally 
that F(t, y, z) and W(t, y) are twice differentiable in all their three arguments 
in region B of the (4, y) plane for any z. 
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1. Let(S.52) have a unique solution y°(#), 2°(t) anda positive determinant 
Dit, y®, 2°) = Syy-Szz—Siz> 0. (S. 53) 


Since the right-hand sides of (S, 52) are continuous and have continuous 
partial derivatives, the existence theorem of implicit functions indicates that 
y®(t), 2°(t) are continuous and differentiable on [a, 6]. If T2(t)<y(t)<Ti(¢) 
everywhere on [a, 5], y°(t) coincides with the zero closeness function y(t) of 
the minimal and z°(¢) coincides with its local slope z(f) everywhere on [a, 8]. 
Atthe points {=a and ‘=b, the values of y and y® ingeneral do not coincide: 


9 (a) =a, yo(a). 


Thus, if D(t, y®, 2°)>0, the minimal u% not only belongs to Uo, but its 
zero closeness function 7(¢) and the local slope z(t) are continuous and 
differentiable everywhere on [a, 6], except the points f=a and t=b, where 
g(t), in general, may have discontinuities of the first kind. If a posteriori 
we find that 7’(¢)=z(t) almost everywhere on [a, 5], then @ CU. 

2. Now suppose that system (S, 52) has a finite number of solutions for 
which D=<0 almost everywhere on [a, bj. By the existence theorem of 
implicit functions, these solutions are piecewise-differentiable. The 
minimal y(t), z(t) consists of the sections of these solutions with D>0 and 
of sections of the boundary. The selection of these sections is determined 
by the sufficient and the necessary condition (S. 49). It follows from the 
same condition that, besides the end points, the functions y(t) and z(t) may 
have discontinuities of the first kind at points ‘=p:i, where 


Sei YM (HD), Z(H) =Si 9 (Hd), 2 (2). (S. 54) 


Here y()(t), 2(¢) and y@(#), z(t) are different solutions of (S. 52) or 
pieces of the boundary. At those points where (S, 54) is not satisfied but 
D=0, y’(t) or z’(t) may become infinite or not exist at all. Moreover, at 
some points of the segment [a, 6] or on some continuum, we may find that 
condition (S.49) is not satisfied for any of the solutions of (S.52), and inf S(t, y, z) 
is attained for z=+ooor, if Bis unbounded, for y= -+co(S remaining 
bounded). It follows that in this case the minimal Z does not necessarily 
belong either to Uy or to Uo. 

By Theorem 3, it always belongs to Uo. As in previous cases, the 
minimal z(t), y(f)is fully defined by (S. 49). 

Together with region B, we introduce a right cylinder Q in the (¢, y, z) 
space having B as its base in the (f, y) plane. According to the statement 
of our problem, the function F(t, y, z) is continuous in Q together with its 
derivatives F;, Fy, F,. 

3. System (S. 52) has no solutions in the interior of the cylinder Q in 
the (4, y, z) space. The minimum is attained on the boundary of Q. 

Suppose the minimum is attained on the upper boundary I, (¢). We thus 
have 


y=Ti(t). (S. 55) 
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The local slope z of the minimal is obtained from the condition 
S(t, T,@, ZB=inf S (4, P,(9, z). (S. 56) 
0 €Z<g 00 


If z(t) is finite, it satisfies the necessary condition 


S.=F,(t, T,(¢), 2) —W(t, T(£)) =0. (S. 57) 


If T,(¢) € U and z(t) is bounded, we have 7 CU). We would like to make 
one observation concerning this case. 

If I’ (t)—z(t)<0, the approximating sequence {yn} constructed along the 
previous,lines, i.e., the curve 


yi=y (ti +0; yi=z(t,+0), i=0, 1, 2,...4, 


does not belong to the set of admissible lines, since it extends beyond the 
boundary of B. In this case we should take 


y=9(t,+0); yr=z(t, +0), (S. 58) 


where 
Y1= Yi-r t+ yi-1* AL = y(t; — 0). 


The sequence {yn} constructed in this way clearly belongs to B and, in 
virtue of the continuity of the functional /(u), u C Uo, in the sense of (S, 42), 
it is also a minimizing sequence. 

4. D(t, y,z)=0 everywhere in the cylinder Q. System (S. 52) is 
equivalent to the single equation 


f(t y, 2) =0. (S. 59) 


This equation contains two independent unknown functions, y and z. 
Indeed, the class Ug in general, contains infinitely many (y, z) lines satis- 
fying the necessary conditions (8.52), These lines may include extremals 
which belong to U. The latter should satisfy, almost everywhere on [a, 5], 
the differential equation 


f(t y, y’) =0. (S. 60) 


Such an extremal may have discontinuities of the first kind on any finite 
set of arbitrarily selected points ¢;( [a, 6] containing at least one point. On 
the smooth sections it satisfies the equation 


y(t) =y(t, C;), 
where y°(t, C;)is a general solution of (S. 60), and C; is the integral constant 


for the i-thsection, if there exist C;C,,, such that condition (S. 54) is 
satisfied at the points /;, which takes in the present case the form 


SA, 9° (ti, Cr), 9 (te Cr) =Sbo W(t, Cian, y (tir Cray). (S. 61) 
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Thus in general the extremal u belonging to the set U is not unique. The 
only exception is the case when 


f.=0. (Ss. 62) 


The extremal is then unique and is defined by the equation 


F(t y) =0. 


In virtue of the properties of F(t, y, z) and W(t, y), the function #(t)is 
continuous and smooth everywhere except the end points, provided f,(t, y) #0 
everywhere on (@, 5), 

A more restricted problem can be formulated: find the extremal with 
the least number of discontinuities on U. It is readily seen that this 
extremal in our case consists of two smooth branches which are solutions 
of equation (S. 60) passing through the points (a, a.) and (0, b,)and a straight 
vertical segment ¢=to joining the ends of these branches, where f) is some 
point of the segment [a, 6]. 

This result coincides with Razmadze's result obtained in /13/. 
Razmadze's condition for the case of a "floating" discontinuity point also 
can be obtained without difficulty: 


F (to, Yoo Yo) == F (to, You yo); (Ss 63) 


Fy: (tos Yo. yo) = F y' (toy Yo. yo)= 0, 


where yo=y(to+0), Yo=y(to—O0). To this end, it suffices to take W(t, y) =0. 
in (S.52) and (S.54). Then from the second equation in (S. 52) we obtain 
the second condition in (S. 6), and from (S. 54) we obtain the first equation 
in (S.6). Note that Razmadze's conditions both for a fixed and a "floating" 
discontinuity are valid only when W=0. 

The sufficient conditions of extremum, as before, are presented by 
(S. 49). 

5. As we have seen above, if y(t) U and z(t)=¥’(t) almost everywhere 
on [a, 6], the minimal w CU. It is readily seen that in this case y(t) almost 
everywhere satisfies Euler's equation 


Fa da v 
Fy(t, yy ar Fy (ty, y')=0. (S. 64) 


To show this, it suffices to differentiate the second equation in (S. 52) 
and subtract it from the first. 

Conclusions. I. Letus summarize the results of this section. Ifthe 
function f(t, y, p, 1) for p=0, (t, y)C B, exists and is continuous in p, the 
absolute extremum of the functional (S.13) is attained on extremals of a 
special kind (we call them type a extremals), which are fundamentally 
different from the classical Euler extremals (which we call type b 
extremals). In distinction from type b extremals, every linear element 
(y, 2) of type a extremals is independent of the other elements and minimizes 
the function S(t, y, z) for every fixed ¢C[a, 6]. To obtain finite equations 
of these extremals, we do not have to solve any boundary-value problems: 
they are obtained directly in the form of the necessary condition (S. 52), 
An extremal on U may belong either to this set or to a larger set Uj). In 
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the latter case, although w EU, the solution y(t), z(t) fully describes a 
minimizing sequence {yn} CU (see Definition 3). 

Il. The function f(t y, p, 1) for p=0 and any ¢, y € B has a discontinuity 
of the first kind in p. 

In this case, no analog of Theorem 3 giving the necessary and sufficient 
conditions of minimum and providing a complete solution of the problem 
exists. We will only derive the necessary conditions of minimum in Up. 

The minimum is attained both on the (y, z) minimals of the type 
considered before and on the classical Euler—Lagrange extremals. Let 
u€U,. By Theorem 2 we have 


0 


Re (Pi47 
Iw=> [Fi y, 2+W (4, y, sign (y’—z)] X 
imo | Bj+O 
y; -_ 
x (y'—2)dt-+ |W (wa, sign(y—y,)) ab} . (S. 39) 


y 

Here ti are discontinuity points of y(t). By (S.39), the functional /(u) 
may be treated as depending on a pair of independent functions y(t), 2(¢). 
Using expression (S. 39), we can write and investigate the expression for 
the first variation of /(u) with respect to y(t) and z(t) in the usual classical 
sense. The class of admissible functions is further restricted by the 
requirement of smooth 2(/)on the intervals (ji, wit). The discontinuity 
points p; are assumed fixed. 

The necessary condition of minimum is 

6/(u) >0. (S. 65) 


The expression of the first variation is 


Rk ¥r19—9 
m= f [Faby t+ F8z+-Wy(y!—2)8y + 
te0 P;4 0 


k 
+W8(y'—2)dtt+ F (Wes, yi sign(ys— yl by; — 
i=0 


—WiIp;, yi, sign(y;—y)+5y;, —by,)] 5y;), (S. 66) 
where 
by=y" (—y(); Bz=z"(—2(4); BY =U" (N—y' (A, 


y*({), 2*(t) are the corresponding functions of the line u* € Up which is 
sufficiently close to first order to the line u. 

Since the functional (S. 39) is linear in y’(t), its variation with respect 
to y’ coincides with the increment with respect to y’(t). Therefore y*’is 
not necessarily close to y’(t) and we may take 6y(t)€ U. After simple 
manipulations, the integral term in the expression of the first variation 
is easily written in the form 

RBH? 
M(u= Df (lFy— Waly’ —2)] by + Feby’ + (W—F,)8(y'—2)} de. 


im0 p, +0 


This expression gives 6/(u),as a functional of four independent functions 
y(t), z(t), dy(t) and 6(y’—z(t)). Since é6y(t) C U, we regard the functions 
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y(t),2(t), and 6(y’—z(t)) as given and use (S. 31) and (S. 39) to obtain 


BR bp41—0 by 
w= fet Wily’ —2ley— | FL yb, 2(0 ae) dt + 
t=Q By+0 c~0 
Rk j41-0 
+> § (W(t, y, sign(y’ —z-+8y' —8z)— F,]3(y’—z)X 
ing 2, +0 
Rey ay by k = 
xat+> J Fi lrada— f Faonat| + > [Ws y, sign (yy — 41) 34, — 
i=Q c=0 c=0 f=0 
k,—k = = 
—Wes, yr sien(y—y)) by] + DYE, ys sign (8y;—3y;)) — F,\(dy) — By), 
J=-0 


where &, is the number of discontinuity points of éy(t). 
Finally we obtain 


& Bi4 7-0 
o(w=Y) J [Fe ap Ft Wy’ —2)] buat + 
int BjtO 
k Hj4470 
+S) § [W (¢, y, sign (y’—2-+-8y' —bz))— 
i=Q By +9 


k 
—F,J8(y'—z)dt+ > (IW, yi. sign(y:— y)))— Fz )byi—- 
1=0 
—|W (w, W sign(y, —y))\—F, (4, Dy, 9) by) + 
ki—k as = 
+ SIM, gn sign(y;—8y)) — Fat yn 2s) ys — by). (S. 67) 
7=0 
Here 6yand 6(y’—z) are independent variations. (S. 67) leads to the 
following conclusions. 
1. The functional may have a minimum only if the right and the left 
limits of f(t, y, 0,1) along the extremal satisfy the condition 


W(t, y, signe) e > 0, (S. 68) 


This is readily established if we take in (S. 67) 


by (4) =0 
for 
[é—f| DVN, HE [a, BJ 
and 
by=by (4) #0 
for 


|t—~fo|<n, |n|>0. 
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For a maximum, this inequality should be reversed. This condition 
can be regarded as the first necessary condition to be satisfied by the 
functional if it is to have an extremum on the class U of admissible lines. 

2. The extremals of the functional (S.13) may be of the following types: 

a) The functions y(f) and z(t) satisfy the equations 


Sy = Fy(t,y,z)—Wy (ty, sign(y’ —2))2— W,(t,y) = 0; 
S,= F,(t,y,2)—W (ty, sign(y’ —z))=0. 


(S. 69) 


It is readily seen that these equations lead to zero coefficients before 
éy and 6(y’—z). To verify this, it suffices to differentiate the second equation 
in (S. 69) and subtract it from the first. Equations (S. 69) may be written 
in a more detailed form 


Sy = Fy—Wy (ty. 1) 2— Wi (Ay, 1) = 0; (Ss. 70) 


S,=F,—W(ty,1)=0, y'—z>0; 


Sy = Fy—Wy (ty, —1)—W (ty, — 1) =0; (S. 71) 
S,=F,—W (ty,—1)=0, y’-—z<0. 


Along these extremals, the first variation is zero, i.e., the extremals 
are Stationary. Moreover, as we see from (S. 69), they satisfy the 
necessary conditions of minimum of the function S(¢, y, z), but now the 
function additionally depends on the sign of the difference y’—z. These 
extremals are analogous to the (y, z) minimals of the previous section. Any 
infinitesimally small section of these extremals retains the property of 
maximum or minimum and is independent of all other elements, but only 
within the limits consistent with the inequalities in (S.70) and (S. 71). 

All that we have said in the previous section remains valid for these 
extremals with one reservation: condition (S. 49) is not sufficient in this 
case for an absolute minimum. 

b) The functions y(¢) and z(/) are continuous on (a, 6) and satisfy the 
conditions 


z()=y'(0; Fy~— F.=0; 


[W (é,y, signe) — Fy (yy )]-¢ > 0, e #0; 
(S. 72) 


(W (2,y (2), sign (y(a@)-+ by (a@)— @,))— Fy: (a,a), y’ (a))] 8y (a) > 0; 
[W (b,y (6), sign (b, — y (6) —8y (6) — F,-(6, 4 (d), 9 (4))] 8y (a) > O. 


The first two equations in (S. 72) define y(t) and z(t) and show that a type b 
extremal on U(U)) is a continuous differentiable function which satisfies 
Euler's equation. Every infinitesimal section of these extremals is 
dependent on the position of the neighboring elements and, in general, does 
not possess the property of maximum or minimum. 

However, unlike the extremals in class C, the extremals in class U 
should satisfy an additional condition on (a, 6), namely the first inequality 
in (S. 72). 
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The last two inequalities in (S. 72) define extremal boundary conditions 
for the determination of the integral constants C, and C,. Since édy(a)is 
arbitrary, the first of these inequalities is satisfied if one of the following 
three conditions holds true: 


1) W (@,y(@),1)—F,: (2,y(@), y' (a))=0; 


y(a)—a, >0,; 
2) W (a,y(2),1)—Fy: (a,y(2@),y' (2)) =0; (Ss. 72a) 
y(@)—a, <0; 


3) y(a)=a,. 


The second inequality defines analogous conditions at the right end. 
Thus, if the solution of Euler's equation is unique for each pair of 
these boundary conditions, this equation together with (S. 72a) and 
analogous conditions on the right end may produce nine type b extremals. 
The first two equalities in (S. 72a) constitute natural boundary conditions; 
they are meaningful if y(a)—a,>0 or y(a)—a,<0, respectively. The last 
equality in (S. 72a) follows directly from the first inequality in (S.72); it 
shows that the solution of Euler's equation for the boundary conditions 
y(a) =a, and y(6)=6, is also an extremal in U. 

c) It follows from (S. 67) that, in addition to type a and type b extremals, 
there are also mixed extremals consisting of pieces of type a and pieces 
of type b, with the functions y(t) and z(¢) suffering discontinuities at the 
points pi. 

In drawing up the expression for the first variation (S. 67), we used the 
comparison lines u* whose functions y(t) and 2(¢) are close to the original 
y(t) and z(t) everywhere, except at the discontinuity points np; We now 
enlarge the group of comparison lines by including (y, z) lines with the 
discontinuity point displaced in an e-neighborhood of wi. Using expression 
(S. 61) for the functional /(u) and comparing it with the functional along 
these lines, we obtain, as in the previous section, an additional necessary 
condition, which should be observed at the discontinuity points j,: 


S(pi, Ys 21) =S (ui Hi, Zi). (S. 73) 


Here the pairs of functions y(t), z(t) and y(t), z(f) are either one of the 
solutions of system (S. 69) or the solution of Euler's equation with z(t) =y’(t) 
and one of the boundary conditions at the discontinuity point tu, 


1) W(e,9,,1I)— Fy (b:5419;) =9; 
¥1— 91 > 0; 
2) W (u,91, —1)—Fy (PY, )=0; 


3) w= Hs 


(S. 74) 


considered jointly with one of the analogous conditions at the other end 
point. 

We have thus established that if the left and the right limits of 
f(t, y, 0,1) exist, and are different from each other, the functional (S, 13) 
may have extremals of two types, a and b. Type a extremals are analogous 
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to the (y, z) extremals of the previous section, and type b extremals coin-~ 
cide with the extremals in the class of continuous functions on (a, 6). 
There are also mixed extremals consisting of pieces of types a and b. 
Additional necessary conditions have been derived which the functional must 
satisfy in order to have an extremum in U, which coincides with a weak 
local extremum in the class of continuous functions C. 


1.4, Special cases 


We will now consider the most characteristic particular cases when the 
condition of existence of M(pi, y, ¥) enables us to fix the various dis- 
continuity points of the extremal ¥(!) and to determine its qualitative 
behavior. 


I. The functions f(t, y, 0,1) do not exist anywhere in region B in the (tf, y) 
plane. 


The function M(f, y, y) does notexistforall t,y,7C 8B, y*#¥. Theintegral 
(S.13) exists and is finite only onthe continuous curves from U. The minimal, 
if it exists, belongs to this category. The problem thus reduces to finding 
a minimum in the class C of continuous functions. Further investigation 
of this case requires application of the theoretical-functional techniques 
(1 —4) described in the introduction. 

An extremal in general depends on two parameters which are determined 
by the values of the function y(t) at the two ends, y(a)=a,; y(b)=6;. Under 
certain conditions, this is a smooth curve satisfying the Euler equation. 

Functionals with the functions f(t, y, +0,1) existing at a finite number of 
points {+,;.are close to this type. As before, the condition of existence of 
@(t, y, ¥) limits the class of admissible curves to curves from U on which 
y(t) is continuous everywhere, possibly with the exception of ft=p; ((=1, 
2,..., 8). The minimal, if it exists, also belongs to this set. 

The problem thus reduces to finding an extremum in the class of 
continuous piecewise-smooth functions and determining the extremal 
"matching" conditions of the continuous pieces at the verticals t=p;. We 
will confine our analysis of the functionals of this type to the derivation of 
the ''matching'' conditions and discussion of some of the corollaries. 

These conditions are determined by the properties of the function 
f(t. y p, 1) for p=0on the straight lines t=ypj. 


Il. The function f(t, y, p, 1) exists and is continuous on the straight line 


f=p,, p=0, Mo(v,)<y <T, (p,). 


Consider one such value ‘=p € (a, 6). The function W is independent 
of sign Ay, i.e., 


Ww y 1)=W(n, y —1) 


and we may write 
(2, y,y)= B(r,y,c)— 9 (24,0), (S. 75) 
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where c is a constant. 
By (S. 18) and (S. 75), 


J(uy=1M41, 


B 

1M=) F (t,y,y')dt — ® (p,y(p),c); (S. 76) 
a 
b 

12) = | F (ty,y')dé+ ® (p,y,c). 


p 


/(u) thus separates into two independent functionals. Therefore, its 
extremal is made up of two continuous pieces, which are extremals of the 
independent functionals (S. 76), joined by a vertical segment t=. Since 
M(n, y, c) is continuous together with its derivative, these pieces at f=)0 
satisfy natural boundary conditions of the form 


ad d 


dy 


F,'— _ ® (p,y,c¢)= 0; Py — ® (p,y,c)=0, 


ayo 
or 


Fy? (to, Yo. Yo) — W (Yo. Yo) = 03 
Fy" (io, Yos Yo) —W (Yo Yo) == 0. (Ss. 77) 


It is remarkable that each piece of the extremal depends only on the 
position of one of its end points, and is independent of the position of the 
other end. If the continuous pieces of the extremal are piecewise-smooth, 
they satisfy Euler's equation with boundary conditions (S. 13) and (S. 77) and 
Erdmann—Weierstrass conditions at the corner points. In the particular 
case W(fo, y)=0, conditions (S. 77) coincide with Razmadze's conditions 
for a fixed discontinuity point /13/. 

The results are readily generalized to the case of n discontinuity points. 
In this case, the extremal consists of n+l continuous pieces, and each 
i-th piece of the extremal coincides with the extremal of the functional 


Yu 


at Yind vi 
N= J F(tyy' dt J Wee ae— f Wy’) de (S. 78) 
c ¢ 


Pie} 


Each of these extremals is a curve dependent on two parameters. These 
parameters are determined from the natural boundary conditions at the dis- 
continuity points wi-1and wy: 

Fy (pin Ji-ws Yi) —W (4-1,91-1) =0; ) 
payee = : (S. 79 
Fy (ti yoy) — W (hoy) =0, i= 1,2, ... 2-1. 


For the first and the last piece of the extremal, the two parameters are 
determined from the conditions 


y(a)= a4; Fy (90.95) — W (Hos Yo) =0 (S. 80) 


and 


y(O)= by; Fy: (ba Yar Yn)—W (Yar Yn) =0- (S. 81) 
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If the continuous sections of the extremal function are piecewise-smooth, 
they satisfy Euler's differential equation for the functional (S.13) and the 
Erdmann—Weierstrass conditions at the corner points, A remarkable feature 
of the extremals is that, with the exception of the pieces adjoining the end 
points, they are independent of the values of the function y(t) at the ends 


of [a, 5]. 


Ill. The functions f(pi, y, p, 1) are discontinuous in p for p=0, but the 
limits f(u:, y, 0,1) exist and are continuous for y € [T2(yi), Ti (pi). 


Consider one such point 
f=po € (a, 6). 
By (S. 18) 


Uo 


I(u)=) F(t YO, 9 at + ® (Yo,yo.90) +) F (yry') at. (S. 82) 
0 


The function W(tto, Yo, Yo) in this case no longer can be written in the 
form (S. 75) and its derivatives Dy, and 07, are discontinuous at Yo—yo=0. 
These derivatives are expressed by the following relations: 


W’ (po, Yos 1) for yo yy >0; 
0, = Yo Yo > 9; 
ae LW tan —1) for yo— Yo <0; (S, 83) 
o- = {oP etd for yo— Yo > 0; 
% (—W(Po¥—1) for »—y<0.- 


We will only consider the case when the continuous pieces of the extremal 
are smooth. Then the first variation of the functional (S. 83) may be written 


in the form 
Bo 
1 


ts (w= \ [Fy Fy] Bud + Fe (Po don) 8¥0+ 


a 


6 
d ~ 7 _ 
+) [Fe—<e Fe | tudt— Fy (yor Ye) Puo+2® (Ho doie) (S. 84) 
By 


where 
W (0.40.1) 840 — W (2p, Yor 1) 840 


8® (9, Yo. Yo) = oe 
Hor YoYo W (9, Yo.—1) 849 — W (Hp, Yo ~ 1) 8Yo 
for Yo—Yo<0 Sam 


W (PorYo.1)(BYo— B40), if Byo—FYo > | for y,—g—O. 
W (Po, Yor— !)(Byo— bya)» if Byo By <0 
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The necessary condition of a minimum of the functional /(u) is 
6/(u) >O. (S. 86) 


This condition is satisfied if both pieces of the extremal satisfy Euler's 
equation 


Fy-— Fy =0, (S. 87) 


and one of the following conditions is satisfied at the discontinuity point: 


Fy! (os Yo. Uo) — W (os Yo. 1) = 0; 


ee fe (S. 88) 
F (2, Yo, Yo) — W (Yo.Yo + 1) =0; 
Yo— Yo > 9; 
Fy ‘y—W —1)=0,; 
y (Ho Yor¥o) W (Hos Ho 1) (S. 89) 
F y' (995 Yo, Yo) — © (tos Yor — 1) = 0; 
Yo— Yo <9; 
[Fy (YoYo 40) — W (Po Yo) [F uy Wo. %00%9)— W (0,40, — 1] < 9; (S. 90) 
Yo— Yo = 0. 


Thus, if Euler's equation (S, 87) is uniquely solvable for the boundary 
conditions (S.14), (S. 88), (S.89), and (S.90), the functional (S.13) may have 
three extremals in this case which satisfy one of the three conditions above 
at the point where the limit f(t y, +0,1) exists. Two of these extremals, 
specifically those satisfying conditions (S. 88) and (S. 89), are discontinuous 
at this point. Unlike case II, the continuous pieces of these extremals are 
no longer independent of one another, since their end points on the line 
t= are interlinked by the inequalities in (S. 88) and (S. 89); however, 
between the limits compatible with these inequalities, the different pieces 
are still independent. Condition (S. 90) shows that besides discontinuous 
extremals, a continuous extremal may also exist in this case, satisfying 
Euler's equation and the boundary conditions (S. 14). 

If there are n points p; (i=0,1,2,...,2—1) on the segment [a, 6] at which 
the limits W(;,y, 1) and W(ui,y,—1) exist but are different, the necessary 
extremum conditions are satisfied by any function y(¢) consisting of 
continuous pieces which satisfy Euler's equation, conditions (S. 14) at the 
ends, and one of the conditions (S. 88), (S. 89), (S.90) at the points pi 


(i=0, 1,..., n—1). In particular, one of such functions is the continuous 
function y(t) satisfying Euler's equation on [a, 6] with boundary conditions 
(S. 14). 


We considered the most characteristic cases when the existence of the 
integrals @(p:, yi ¥i) provides some indication of the qualitative behavior 
of the minimal and fixes the position of the discontinuity points. In addition 
to cases I—Ill, there may be mixed cases when for some {=p the conditions 
of case II apply, whereas for t=pe2 the conditions of case III]. In some cases, 
only one of the limits f(p, y, +0,1) may exist on the vertical f=uo, e.g., 
f(u, y, +0.1). In this case, the condition of existence of M(p, Yo, Yo) shows 
that the minimal j7(t) at f=po may only display a positive jump satisfying 
(S.88) or not jump at all, i.e., yo—jo= 0. 
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1,5. Examples 


Example 1. Consider the extremum of the linear functional 


b 
1 =f [P(ty)+Q(ty) yl at, y(a)=a,, y(b)=b,. (S, 91) 


This example is of independent interest because of numerous applications. 
We have 


1 : 
Ft y,0.1)= lim p[P-+Q——|=Q(t, y) 


W (Z,y)=Q (4,9); 
(S. 92) 
S=P(t,y)— { Q, (4,2) dé. 


c=const 


For the functional (S. 91) to attain an extremum on the function y=y(t), 
it is necessary and sufficient for the function of a single variable S(t, y) to 
have an appropriate extremum for every fixed ¢ C[a, 6]. The equation of 
the extremal is 


Sy= Py(t, y)—Qlt, y) =0, 1 € (a, 6). (S. 93) 


We thus obtained the necessary and sufficient condition of extremum of 
a linear functional (S. 91) in the class of curves with vertical segments. 
The extremal, in general, consists of vertical segments t=a and t=6 and 
the curve (S. 93) and belongs to the set U. 

Equation (S. 93) also can be obtained by the method of variations, in the 
form of degenerate Euler's equation. Using Green's theorem, we can 
derive the sufficient conditions for a strong local minimum on its solution 
y(t). It coincides with the condition of local minimum of S(t, y) in the 
neighborhood of y° for every fixed ¢. Ifequation(S. 93) has several solutions, 
the absolute minimal consists of pieces of these solutions and pieces of 
the boundary, joined by vertical segments. 

The construction of the absolute minimal from these pieces by the 
method of variations combined with Green's theorem is not a simple 
process. Our method provides an attractively simple solutions to this 
problem: using (S. 92) we construct the function S(t, y) and for every fixed 
tC [a, 6] find the value of y(t) on which S(t, y) attains it least value on 
the segment I'2(t) <y<TI\ (4). 

The fact that S is independent of z signifies that there are infinitely 
many minimals % in U9. This category includes any (y, z) line &@ © Up with 
a zero closeness function ¥(f) constructed by the above method for an 
arbitrary (finite) local slope z(!). In other words, any sequence of polygonal 
lines {y,} ( U satisfying the condition 


yi=¥7(tit0), Jy) |<M, i=0, 1,..., n—I, 


where M>0Ois any number, is a minimizing sequence. 
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Let P=Q=ly. The equation of the extremal is t—y=0, ¢€ (a, 6). Further, 


2 
S=ty— es Along the extremal S= > we have 


Therefore, along a polygonal line consisting of the vertical segments 
t=a and ¢=6 and the straight line (S. 93) the functional attains its absolute 


minimum. The extremal is unique. 
Example 2. (Razmadze). Consider the extremum of the functional 


b 
1=J sin(yy')dt; y(a)=ay; y (6) =), (S. 94) 


a 


The following conditions are satisfied: 
F (6, y, 9,1) =lim p sin (y —) exists and is continuous; 
p-+0 P 


W (y) =0; 
S(t,y,z)=F (t,y,z) =sin (yz); 


Sinax= 1; Simin (Ys2) = —l1 


for any y and Z, so that we obtain the following equations for the two 
families of extremals in Up: 


yz=— +t 2k, yy’ = ——>--+ 2ka, (S. 95) 


k=0,-+1,+2,... 


The equations of the pieces of the extremals belonging to U are 


yy = + Pha, yy’ = —— 4 2kx, 
Integration gives 
y= (2k +1)nt-+C,; y2=(2k—1)at+C,, (S. 96) 
k=0,+1,+2,... 


Since the function S is independent of C;, any extremal may have dis- 
continuities at any point ¢,€ [a, 6]. 

The functional (S. 95) thus has two families of extremalsin U, consisting 
of pieces which satisfy the first and the second equation in (S. 96), with 
arbitrary C;, and have any number of discontinuity points with arbitrary 
abscissas. On the first of these two families the functional (S. 94) attains 
its absolute maximum, and on the second family its absolute minimum. 

Example 3. Minimize the functional 


b 
Hy)=J ll VF gat ‘eu 
y(a)=a,>0, y(6) =), >0 
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(the problem of the least surface of revolution). Here 


1 
Why zN=lim|ylY 1+ B= tly) 


where W depends on the sign of p. Therefore, the functional (S. 97) 
corresponds to type II of the general case. 
Further, 


W(y, signe) -e=|y-e]>0, 


i.e., the functional (S, 97) satisfies the necessary condition of the existence 


of extremum (S. 68), 
Let us find a type a extremal. We have 


S(44.2)=|y|V1+2?—|y|zsign(y’ —z); 
S(y,z)=|y|V 1+2?—zsign(y'’ —z). 


(S. 98) 


We use the necessary and sufficient condition of a local extremum (S. 49), 
which is also valid for functionals with a discontinuous derivative F,, such 


as (S.97). We have 
Smn(y, 2)=S(0, z) =0. 


The equation of the extremals is 
y(t) =0; ax<t<b (S. 93) 


for any 2. Since zis arbitrary, there exist infinitely many type a 
extremals in U,. This is a reflection of the fact that for any curve y*(t) #0, 
|y*(t)|<n, where n is sufficiently small, there is always a polygonal line 
with arbitrary section slopes 

y, =2(t;) and y;=0 (i=0,1,..., x), 


where n is finite, such that /(y,)</(y*). Among these extremals, there 
is one which belongs to U, specifically 


y(t) =0; z(t) =y’ (t) = 0; a<t<o. (S. 99) 
This extremal is unique. It is continuous on (a, 6) and has discontinuities 
at t=aand t=). The surface of revolution of this curve comprises two 


disks of radiia,and6,, joined by a tube of zero radius. The minimum value 
of the functional (S. 97) is obtained from (S. 40): 


o b, 
Imin=$  Syin(ys2)dt + J yi sign(b—y(6))dy— 


= j |y| sign(y(a)—a,)dy=— (a3 +63). 
c=0 
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Letus now find a type b extremal. For a curve to be such an extremal in 
U, it is necessary and sufficient for it to be an extremal in the class C of 
continuous functions on (a, 6), and for the functional to satisfy the first in- 
equality in (S. 72) along this curve. 

Let us check this condition. 


[W (¢, y, signe)—Fy-Je =| [ie a | 0. 
y lyl | Vinay? a 


This condition is satisfied identically, and therefore all extremals in C 
are at the same time type b extremals in U. The extremals in C satisfy 
Euler's equation, whose solution in this case is described by the catenary 
t{— Co 


y=C,ch (S. 100) 
where C, and C) are integral constants. These constants are determined 
from boundary conditions (S. 72), which in this case take the form 


=0; y(b)=0; 
ee ” ; (S. 101) 
y(@)=a,; y(b)=4,. 


Three pairs of boundary conditions (S,101), together with (S.100), yield 
the extremals b, b,, bz, which are shown in Figure S.2. The fourth pair 
y(a) =¥7(b) =0 when inserted in (S. 100) leads to equations which are unsolvable 
for C, and C,. This is a reflection of the existence of an extremal which is 
not a catenary, i.e., does not satisfy Euler's equation. This is the previously 
considered polygonal line ACDB, a type a extremal, which is continuous and 
differentiable on (a, 6). 


B 
y A 
c 
br 
by 
C 4 Bx 
FIGURE S.2 


Finally, let us find mixed type extremals. Since the function S(y, z, 
sign(y’—z)) is everywhere zero on the ¢ axis and is independent of z and, 
moreover, y=£0 for all S>0, condition (S. 73) is satisfied if and only if 


yi=yi—0. 


Thus, in addition to type a and b extremals, there are infinitely many 
mixed type extremals (type c extremals) in Figure S.2. However, only two 
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extremals satisfy the sufficient conditions of minimum: the type a extremal 
by (S. 49) and the type b extremal which is continuous on [a, 6]. The absolute 
minimum is attained on one of these curves, depending on the relative 
position of the points A and B. 

Example 4. Find the extremal in U of the functional 


+1 
I(u)=) #y'dt; 
2, (S. 102) 


y(—-1)=—1; y(4+1)=s1. 


We have 0</(u)<oo, so that we need only consider a minimum. The 
function W(t, y, +1) does not exist anywhere on [—1, 1], except at the point 
t=0, where 


W(0, y, l)= W(0, y, —1) =0, 
i.e., we are dealing with a type II functional. Therefore, the extremal of 


functional (S. 102) consists of two continuous pieces joined at t=0 by a 
vertical segment. The left piece is the extremal of the functional 


Yo 0 
fy!?dt +. i} W (0,y)dy= i Pyat 


-l 


oa 
I 
Jere 


for y(—1) =—1 and the right piece is the extremal of the functional 


1 Yo 1 
I=) Pydt+\ W (0,y)dy= f Py’at 
9 ¢ 0 


for y(l)=1. 

Each of these functionals if positive definite and vanishes for y’=0. 
Hence, their minimum is attained on straight lines parallel to the ¢ axis 
which pass through the points (-1, -1) and (1,1), respectively. The 
extremal uy) consists of these straight lines and a joining straight segment 
t=0, 


I (uo) =0. 


Example 5. Minimize the functional 


b 
= ‘)+sin y'] dt; 
J ty)+siny']at; (S. 103) 
y(a)=a,; y(b)=6,. 
We have 


F (ty, £0,1)= lim + [y(14-p)+sin p]=y, 
p++0 Pp 
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i.e., [(t,y,0, 1)? exists and is continuous. Moreover, 
W = y 
S=y?+sinz; 
Snin(y,z)=S (0,21m——3-)= a1; 
m=0,1,2,... 


The functional (S. 103) has countably many minimals {im}@Uo. They all 
have a common zero closeness line 


y(t) = 0; a<t<o (S, 104) 
with different local slopes 
Zu (t)= 20m — (S. 105) 


This set contains no lines from U. 


§S.2. FUNCTIONAL ON THE SET OF FUNCTIONS 
WITH A BOUNDED DERIVATIVE 


The above results can be applied to the fundamental problem of minimizing 
the functional 


1(u)={ F (ty,p)at (S. 106) 
a 
under the constraint 
y'=8(4,y,P); (S. 107) 
IpI<1; (S. 108) 
y(2)=ay; y(d)=b,. (S. 109) 


It is assumed that F(t, y, p) and g(4 y, p) are continuous and have continuous 
partial derivatives for any ¢ and y for jp|<1. It is moreover assumed that 
for these 7, y, p 


og 
ape >0 (S.110) 


and the functions g(é, y, 1) and g(t, y, —1) retain a constant sign. 

Our problem thus can be stated as follows: among the pairs of functions 
y(t), p(t) satisfying conditions (S.107), (S. 108), (S.109), find a pair y(t), p(t) 
on which the functional (S, 106) attains its minimum value. If y(t) is given, 
equation (S, 107) uniquely defines the function p(t)=g-'(t, y(t), y’(t)). The 
problem is therefore equivalent to minimizing the functional 
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ruth Fanyyat (S.111) 
a 
y(a)=a,; y(b)= 
on the set of functions y(t) with a bounded derivative 
a(t, ¥, —1) Sy() <a (ty 1). (S. 112) 
Here 
Fityy)=Fit y ett yy’). (S. 113) 


Let us elucidate some properties of the set of comparison lines, which 


is designated U?: 

1) By (S.107), (S.108), and (S.110), the function y(t) is continuous along 
the line u € U? and p(t)(and hence y’(t)) is bounded and may have dis- 
continuities of the first kind. 

2) The line uCU? belongs to a closed simply connected region G in the 
(t, y) plane whose upper boundary y=I2(t) corresponds to the solutions of the 
equations 


y’=ag(t, y, 1) and y’=g(t, y, —1), 


respectively, passing through the points A(a, a,) and B(6, 6,) and the lower 
boundary y=I\(¢) to the solution of the equation y’=g(t, y, 1) passing through 
B and the solution of the equation y’=g(t, y, —1) passing through A. 

3) We impose an additional restriction on U?, namely that p(f) is 
piecewise-smooth and may only have a finite number of discontinuities 
of the first kind, and consequently may only contain a finite number of 
sections with p=1 or p=—l. 

Let (é.2) 

¥= (2,7); 
pean) | ene 


be the general integrals of the equations 
y'=e(t y, 1) 
and 
y’=g(t, y, —1), 


respectively. Here t and T are integral constants. We have 


9 =e (6y0, 
S115 
= Fy 0): reat 


In virtue of the above properties of the function g(t, y, p), only curve 


y=@(t, t) or y=(t, T) passes through each point fo, yo in G, i.e., to every 
point (f, ¥)) corresponds a single pair of values t and 7. 
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Consider a pair of piecewise-smooth functions y(t) € U? andg(t), 
where |q|<1. We construct the following sequence of polygonal lines 
{yn} C Ue: the segment [a, 6] is partitioned into 7 intervals by the points 


a=l<hy< we Shah = 6. 
We take 


Yn (ti) =y (ti) (S.115a) 


and through each point (4, y(ti)) pass a pair of curves y=9(f, ti) and 
y=v(t, T;) which together with the pair y=@(t, ti41), y=p(h Tizi) on (ti, C141) 
define a region G; of the admissible values of yn(t), which is constructed 
along the same lines as the region G. For all (fi, ti41:) we take 


Pn= q(t) (S, 116) 


and yn(t) is correspondingly set equal to the solution of the equation 
y’=g(t, y, g(t)) for those t € (t;, ti41) where this solution belongs toG;, and 


Yn(t) =o(t, tii) or yn(t) =p(t, Tis) (S.117) 


for all other ¢ € (ti, fit1).. The choice of equality (S., 117) is determined by 
the condition of continuity of Yyn(t). 
We say that the sequence {yn} approximates to the line u € U? or {yn}—-u 
and correspondingly {yn(t)}—uif ne 
Noe 


y'(t)=ge(4 y 9) (S. 118) 


almost everywhere onfa, 6], and for n—ovo, 
ren 2 ees (S. 119) 


If the pair y, q does not satisfy (S.118), we say that {y,} approximates to 
a (y, q) line 4 CUP, or {yn}—u; y(t) is the zero closeness function of the 
line uo, and q(t)is a local value of the parameter p. 

We take 


/(uo)= lim /(u,). (S. 120) 
{Tn} He 
Theorem 1. The functional /(uo), 4% C UP, exists and may be re- 
presented in the form 


6 
1 (wo)= [F (69, DLP 6.9.9) — F (fy, © DI} X 


a 


ee eet (S.121) 
&(ty.q)—e ty, +1) 
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where 
+1=sign[y’—g(t y, 9)]). 


We have 


n—l 


1)= & LF (t,9n9 Mt —t)+F [fy sign (yrs — 


—y (4) 141-4) +0 (44), (S. 122) 
where 


H=H9(h); G=G(ti); Ati=tyi—t, 
i, € (ti, tig1) is the abscissa of the point where the straight line 
9=Yit¥ (tis Yu Gi) (tt) (S. 123) 
meets the boundary of G;, i.e., intersects the curve y=q(t, ti+;) if 


Yini—y (Ei) >0 and the curve y=p(t, Tisi) if yiti—y (ti) <0. 
We have 


fot gv i—altiyn 4 gay), 
Meeting —~eltiyin B11) 


. = F (S. 124) 
sign |yi41— y(t) = sign [ys41— 9; — 8 (li Yar 91) X 


X Atj|=sign (y(t) — e(ti.92.9;) +0 (at). 
For sufficiently small Aj,, 
signlyi+i—y (¢1)]=sign[y’ (4) —g (ti, ys, 9:)], (S, 125) 


if 
Y (ti) Fe (ti, Yi, Ji)- 


From (S.122), using (S. 124) and (S.125), we find 
a—l 


Lvd=Y) (Fr 4.9) Fltignt MN X 
i=0 


yp —etign $1) 


ei NE, Rar } t, +o(Aat)). : 
CRT eri irra) ae (Fiyerk Dy Af +0 (Ah) nde) 


The argument +1 in these expressions stands for 
sign[y’(t:)—g (ti, ya 9i)]. 


In virtue of the properties of F(t, y, q), y(t), andq(t), the function 


F (¢,y,9)—F (ty, £1)) “Meo £4 ee y(t),-+1) (8.127 
DE eee omMaseeean ee eee 
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is bounded and continuous almost everywhere on fa, 6]. Therefore, by the 
Lebesgue theorem, it is Riemann-integrable. The limit (S. 120), where 
I(yn) is defined by (S.126), therefore exists, is equal to (S.121), and does 
not depend on the choice of the sequency {yn}. Q,E.D. 
Corollary 1. If y’=g(t,y,q), the integrand in (S,121) is equal to 
F(t, y, g) and, therefore, definitions (S. 106) and (S.120) coincide on U?, 
Corollary 2. Since y’ enters the integrand in (S. 121) in linear form 
for y’—g(t, y, gq) 0, and does not appear in the integrand at all for 
y’—g(t, y, 9) =0, the functional (S, 121) is continuous on Ufin the sense that for 
any given e>0 there exists 6>0such that 


[1 (u)—I (iz) |<e, (S. 128) 


if ly—y|<n, |g—-F|<6 almost everywhere on [a, 6]. According to the lemma, 
the minimals in U? and U coincide, and therefore instead of minimizing the 
functional in U2 we can minimize it in the class U2 of (y, q) lines, using 
expression (S.121) for J(u). However, expression (S. 121) in its original 
form is not particularly convenient for the determination of the absolute 
minimum, since it contains the derivative of the zero closeness function y’. 
We will adopta different approach and try to establish a relationship between 
this problem and the problem of minimum of /(up), u © Uo. To this end, 
we use (S. 31) and Theorem 3. Along any line u CU?, the equation y(t) = 
=(t, t) defines a piecewise-smooth function t=f(t) which has dis- 
continuities of the first kind atthe points t=y; (i=1, 2, ..., n) corresponding 
to the sections p(t) =1 of the line uw, if any. Since each t corresponds to a 
single ¢, the variable t can be chosen so that ¢(t) is an increasing function, 
i.e., 


a 0 (S. 129) 
dt 
on the smooth sections and 
t(t+0)—#(t+—0) >0 (S. 130) 


at the discontinuity points. 

The functional /(u) may be considered on the set of the functions y(t), 
and not on the set of y(t), if we make the following substitution of variables 
in (S. 106): 


=) (t, y); 
Pie 8 (F141) (Ss. 131) 


Tt, 


at= 
8 (91.4.1) — 2 (91.4.7) 


where gi(t, y) is the inverse of 9(t, tf). Since g(t, y, 1) maintains a constant 
sign, it exists and is continuous and differentiable. 
We have 


aml iy) n 


I(uy=> j Fy(t,y,p)dt+ > ® (p,, Yt, Yi); (S. 132) 
f=) p; fet 


19] 


. 9128 (91591) 8 (P14) 
=g,(t,y,p) = ee eee S. 133 
Y= 81 (T YP) £ (91,951) —& (41-9157) ( ) 
where 
) =A@=T(a,a,), Ha = P= 1 (8,0,); (S. 134) 
9428 (F154 01) 
Fi =F (¢,y, 8.135 
: (#104?) £91,451) — 8 (91.45) ( ) 
t y 
~\ Fir (t,&),€,1] 
® (1,4, =f F (t,o (t,t), 1)dt==( 1 9)8 71 oe. S$. 136 
(ay=j PGREDD erererrar 
t ¥ 
y= tt. (S. 136a) 


Consider the set Ut of piecewise-smooth lines on which the function 
y(t) is single-valued everywhere, except a finite number of points t=, 
where y(t) may have discontinuities of the first kind. In the (t, y) plane, 
the set U* is a perfect analog of the set U in the (t, y) plane, as introduced 


in Chapter II. 
We have U? (C U*. Let uC U*. This line belongs to UP if 


P(t)> —1; 

y (4s + 0) —y (4, —0)> 0. tote) 
The condition p(t)<lis satisfied automatically, in virtue of the particular 

choice of the independent variable t. The vertical segments of the line u 

in the (t, y) plane, t=yi, are the solutions of the equation (S.107) for p=1. 

It follows from (S, 133) that for y—oo, we have p—l. p(t, y, y) can be 

expressed from (S.133). Inserting the result in (S. 135), we obtain the 

function F(t, y, y). We have 


F (#1 9,1) 


W (t,y)=lim F; (t,y,y) —= 
(t,y)=lim F; (t,y,y) Fa 


1 
ire y 


(S. 138) 


The functional /(u), a@U*, is defined by (S.132). Since the integrand in 
(S. 136) coincides with the limit (S.138), and this limit does not depend on 
the sign of the difference yi—y: this definition coincides with Definition 1 
from ¢S,1, and the functional /(u), u@ U*, corresponds to type I of the 


F 1 : : : 
general case, when the function pF; (= y, Pe exists and is continuous for 


p=0. Therefore all the theorems formulated for this case still apply to this 


functional. 
We introduce the set of (y, z) linesU3. Here Z is the local slope of the 
line up C Uj in the coordinates t, ¥. To every z corresponds some g defined 


by the equation 


2=61(t, ¥, 9). (S. 139) 


This is a one-to-one correspondence, since 22 >0 for z=oo. Since 
q 


£1(t, y, 9) is continuous, we have z—2) ifg—+qo, and vice versa. The line 
uy CUZ may therefore be defined either by the pair of functions (y, z), or 
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by the pair (y, g). We have referred to the function g as the local value of 
the parameter p. By Theorem 1, we may write 


8 
I (1) = J S(x,y,2)dt + ® (8,b,,c)—  (a,0;,¢); 
= (S. 140) 


y 
S=F,(1,y,2)—W (x,y)e— | W(x, ah, 


where ¢ is an arbitrary constant, or, using (S. 133) and (S, 135), 


B 
I (ig)=\ S,(x,y,9) dt + © (8,,,c)— © (a,a2,,0); (S. 141) 


Sy oy, PY BE HW NFP 8 Pr 7) 
(91 (Ty). 45!) —@ (91 (4,4), 4 59) 


y 
Fr(tiy lg (ey 1) —F (eisy sl) oe (Piy Dd 


—f a(t, ¥) 82 (91 (t.y),4 01) 


ec 


dy. (S. 142) 


The theorems of §S.1 ensure a complete solution of the problem of 
minimizing /on U*. To apply these theorems to the present case, we have 
to establish a relationship between the sets Uj and U? ¢ Uf, This relationship 
is established by the following lemma. 

Lemma 1. Consider a (y, q) line u CU3. A necessary and sufficient 
condition for the existence of a sequence of polygonal lines{yn}—uo; {yn} CU?, 
i.e., a necessary and sufficient condition for uC U2, is that the functions 
q(t) and y(t) satisfy the constraints 


q(t) >—1; (S. 143) 
y—#i(t, ¥, 9) BO (S. 144) 


at the points of continuity of y(r) and the constraint 
y(ui)—¥ (wi) DO (S. 145) 


at the points of discontinuity pi of y(t). 
Necessity. 1) By (S.137), the sequence {y,} approximating a (y, 4) 
line u C Uj belongs to U? if 


q(t)> —1, 
a S. 146 
yi— yr > 0. ( ) 


(S. 143) follows directly from these inequalities. 
2) Let at the point t€ (a, §), 


¥(t)—ei(to, ¥(to), 9 (to) ) <0. 


Let yn be an approximating polygonal line sufficiently close to u, and T; 
and Ti+1, t<toSti41, are the partition points closest tot. We have 
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Yrs Fis = Yier Gt — In (TW) AT = 
=Ly (To) — & (Tor ¥ (To) J (To) +0 (AT,)] AT}. (S, 147) 
Here t* C[ti, titi], Ati=ti41—ti. Choosing n so that At, is sufficiently small, 
we find 
Yin1— Viti <0. 

This signifies that (S.137) is not satisfied at the point tii1, i.e., for 
n>N(Ati), the sequence {yn} does not belong to U?, Thus, (S. 143) and 
(S. 144) are necessary conditions. The necessity of (S. 145) is self-evident. 

Sufficiency. Let(S,143), (S.144), and (S.145) apply. The condition 
pS! is satisfied automatically on the entire set U* (and therefore on U? also) 
because of the particular choice of the independent variable tT. Construct 
a sequence of polygonal lines {yn} approximating a (y, g) line uC Uj. We 
will show that forn>N, {yz} CU. To this end it suffices to show that 


(S. 146) is satisfies for n>N. The first condition is true by (S.143). Now, 
let the zero closeness function y(/) be continuous and differentiable on 


(tit, Ts): 


Ai i 
4—- HN Yi —§ gat =| (y—81(t,y,9)) at. 


re) "1 
By (S. 144) 
y—ei20, 
so that 
yi—YiD0. 


Let now y(t) have a discontinuity on (t;-1, ti). For sufficiently large n 
(small Atmax), this is the only discontinuity on (t+), ti). Then 


41 —4 =y(p) —y(v)+0(At,), 


where uw is the point of discontinuity of y(t). By (S.145), for sufficiently 
large ”, 


yi—yiz0. 


Thus, for n>N, (S.146) is satisfied for all i=0, 1, ...,. n—l, m. Hence 
fyn} (U2. Q.E.D. 

A similar lemma for the lines u € U7 is proved by replacing (S. 143), 
(S.144), and (S, 145) with the following conditions: 


I<); (S. 148) 
d 
ap bull y,9) <0 (Ss. 149) 
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at the points where y(¢)is continuous, and 
y(T;+0)—y (T;—0) <0 (S. 150) 


at the points T; where y(T) is discontinuous. 
Let 7, g be a pair of functions satisfying the condition 


Si(4,y, N= inf S(1,y,9) (S.151) 


P(t) <y <P, (t);—-1<q<l, 
and y*, q* a pair satisfying the condition 


S2(T,y*, gq") = inf S2(7,y,9): (S. 152) 
T,<y<Ty—l<qsl. 

Here US, g2(T, y, 9), S2(T, y, 9) are the analogs of Uj , gi, S,; in the coordinates 
T,y. 

We now prove the following theorem. 

Theorem 2, Let the (y, q) line # C Uj defined by the pair 7, qg satisfying 
(S. 151) satisfy conditions (S. 143), (S,144), (S.145). Then @ is the absolute 
minimal of the functional (S, 106), i.e., 


1 (w)= int J(u), (S. 153) 


It follows from expression (S.141) for /(u), uC Uj,that @ is a minimal 
on the set of the (y, g) lines uC Uj satisfying the additional condition g@—1. 
Since this set encloses U”%, we have /(i) </(u),4C U?. 

If & satisfies (S, 144) and (S.150), there exists a sequence {yn}— i, 
fyn} CU". Thus, by definition of the lower bound, 


/ (a) = inf / (a). 
ueu? 
Q. E. D. 
A similar proof can be given for Theorem 2*, 
Theorem 2*, Ifa line u* € U? defined by the pair y*, q* satisfying 
(S. 152) satisfies conditions (S,149), (S.150), we have 


1 (u*)= inf J(u). (S. 154) 
a eu? 


Corollary. Let the line u* € U{ defined by (S. 152) satisfy (S. 148), 
(S.149), (S.150), i.e., u* CUP, Then, if /(Z)s4/(u*), where /(#) is defined 
by (S, 141), the line 2 does not satisfy (S. 144), (S.145), i.e., 2 does not 
belong toUg. The reverse proposition is also true. 

Indeed, suppose that these conditions are satisfied on i € Uz. Then, by 
Theorems 2 and 2*, /(u), uC U?, has the lower bounds /() and /(u*), in 
contradiction to the definition of the lower bound. 

Theorem 3. Let the functional (S.106) have a minimum on the (y, q) 
linea € Up. Furthermore, let the following inequalities hold true almost 
everywhere for the pairs j, J and y*, g* defined by (S, 151) and (S, 152): 


dy ~~ 
yee a t, ’ <0; 
rks att99) (S. 155) 


fy" ge ty* gt 
1 8 I") 0. 
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The minimal is then a continuous piecewise-smooth function. 

Since z € UP, we see that 7(#) and g(t) are piecewise-continuous and, 
therefore, wz may consist only of a finite number of sections on which one 
of the following three conditions is satisfied: either 1) y’(#)—g(t, 7, 7)>0, 
or 2) 9’(t)—g(t, ¥, 4) =0, or 3) W()—e(t 9, 7)<0. 

By the remark to Theorem 1, condition (S, 151) should be satisfied on 
those sections where y’—@(t, ¥, 7) >0, i.e., 7=% and g=g, but this 
contradicts (S.155). Consequently, such sections do not exist. Similar 
argument proves that sections of type 3 do not exist either. Therefore, 
we have almost everywhere on [a, 6] 


¥()—e(t 7, 7) =0, (S. 156) 


i.e, BEU?. 

Corollary. Since the minimal uw is piecewise-smooth, it can 
be determined by ordinary classical methods, e.g., the maximum principle. 
The minimal consists of a finite number of Euler pieces and pieces having 
the limit direction p(t) =]. 

Theorem 4, Let [m, t2]€ [o, B] be an isolated segment on which the 
functions §(t), G(t) obtained from (S,151) satisfy conditions (S. 144), (S. 145). 

We construct the following object wu: 

1. On [a, 1], the minimal @ coincides with the absolute minimal of the 
functional 


N(any=J Fi (ty,p)dt—® (ty, yi), (S. 157) 


with a free right end yi=y(t)). 
2. On (1%, tz), the object w is a (y, q) line: 


y=y(t); 9=9 (1). (S. 158) 


3. On [t2, B), the object @ coincides with the absolute minimal of the 
functional 


8 
Io(tt,y2)=$ F,(t,y,p) dt + ® (Tp, yo), (S, 159) 
with a free left end ye=y(t2). 
If 
n<y(%)s 
aes (S. 160) 
Yo> y (Tr); 
the object ¥ is the absolute minimal of the functional (S.106), i.e., 
/(u)= inf / (uw). (S.161) 


u eu? 
Proof. The functional /(z) may be written in the form 


1 


I@)=) Fde— O(n. 9) +f S(epdaet 
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8 ty 
+) F dt + ® (1, Y)= 1 (41,91) + 12 (Way) + i) S(t,y,9) at. 
Similarly, along any line u CU?, 
J(u) =1,(te,43)+ 1a Hasyn) + J Syn) at. 


By the conditions of the theorem, 


N@py)—h (41,41) 
1, (42a,Y2) — 1 (2,40) 


>0 > 
> 9 


> 


{ S(uy.pyde—J S(x,9,9)dt=] [S(x,y,p)— S(e.y,D)ldt > 0 


and therefore 
I(u)—/ (az) D0. 


On the other hand, by the conditions of the theorem and by Lemma 1, we 
see that there exists a sequence {y,} ¢ U? such that /(yn)—/(2z). Therefore, 
N>oo 


in virtue of the definition of the lower bound, (S.161) applies. Q.E. D. 

Remark. If t,=aor t,=—b, (S.160) is replaced only by the first or the 
second inequality, respectively. 

Corollary. If(S.160) is now satisfied at the points t; and tz, the 
extremal pair (¥(t), 9(t)) does not necessarily coincide with §(t), g(t) on 
{t1, ta], but some of its properties emerge from Theorem 3. 

1. If there exist at least two points t=§, t=£ where 


y(t) =§7(t), (S. 162) 


for t E[E1, &] we have y= and g=@. 

2. If condition (S. 160) is not satisfied at a single point, Ti say, we have 
9(t) SH (rt) for tC [u1, w). If at least one root & C [m, 12] of equation (S, 162) 
exists in this case, for EX<rt<12 we have y=Jy, @=@. An analogous theorem 
clearly can be proved for an isolated segment [f,, #2] on which the functions 
Z(t) and §(t) defined by (S. 152) satisfy (S.149) and (S.150). Inequalities 
(S.160) should reverse their sign in this case. 

Discussion. 1. It follows from the theory that the functional (S. 106) 
on the set U? of functions with a bounded derivative has (y, g) minimals with 
a "branched" derivative, similar to the (y, z)minimals of the simplest 
functional. The role of the vertical directions in this case is assumed by 
the directions p==+l., 

2. As for the simplest functionals of type I (the function pF (« y, ~) is 


continuous for p=0), the solution of the variational problem should not start 
with the solution of Euler's equations, i.e., we should not attempt to find a 
weak local minimal on the class C,. A better approach is to set up the 
function S,(¢, y, g)(or So(z, y, g)) and to find its minima §(t), G(t) for every 
fixed tC [a, BJ. If the functions §(t), ¥(t) satisfy the conditions of Theorem 2 
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(Theorem 2*), this completes the solution. The functions f(r), (1) define 
a (y, q) line Z CU? on which the function (S. 106) has an absolute minimum. 
If almost everywhere on [a, §], 


G2 (tu, J, %) =0, 


we have i CUP. Otherwise, the pair §, § defines a minimizing sequence 
{yn} CU?. For any given e>0, there exists N such that for n>N, 


|1(u) —I(@) |<e, 


where u is any line from UP. 

3. If the conditions of Theorem 2 are not satisfied on @ and u*, 

Theorem 4 enables us to identify pieces of the absolute minimal on those 
segments where the corresponding conditions hold true. To obtain a complete 
solution of the problem in this case, we have to find the pieces of minimal on 
those sections where the conditions of Theorem 4 do not hold true. These 
pieces coincide with the minimals of the corresponding functionals (5S. 157) 
and (S, 159) defined on the corresponding sections. 

4. Ifthe lines # and u* satisfy the conditions of Theorem 3, the minimal 
® of the functional (S. 106) is piecewise-smooth when & € UPand it 
can be found by conventional classical methods, e.g., by Pontryagin's 
maximum principle. 

5. If we compare the results with those of § S.1, we see that the functionals 
(S.106) with regard to their extremal properties are the closest to type I 
functionals of the general case. 

The strongest direct expression of this analogy is provided by Theorem 2. 
However, unlike the type I functionals, functional (S, 106) may also have 
smooth Euler minimals, if the conditions of Theorem 2 (Theorem 2*) are 
not satisfied. In this respect, functional (S. 106) is closer to the simplest 
functional of type IJ of the general case{ f(t, y, p, 1) has a discontinuity of 
the first kind for p=0). There is a highly significant difference between the 
two types: for type II functionals the minimum of S along the (y, Z) line up 
is only a necessary condition of a minimum of /(u) on the line us, and in the 
presence of this minimal the absolute minimum may be attained on an 
ordinary Euler extremal, whereas for the other type, as for simplest 
functionals of type IL the minimum of S ona (y, g) line u C UP is a sufficient 
condition for the absolute minimum of the functional on this line. 


§S.3. OPTIMAL PROGRAM FOR HORIZONTAL 
FLIGHT OF AN AIRCRAFT 


We will now consider the optimal thrust control for the horizontal flight 
of a jet aircraft over a maximum range. This problem was dealt with by 
Hibbs /15/ and by Miele and Cicala /14/, According to their results, 
assuming a linear dependence of thrust on fuel consumption, 


P=jp, (S. 163) 
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where P is the thrust, B is the fuel consumption, j= const is the effective 
nozzle velocity, we can reduce the original problem to extremizing a 
functional of the form 


flaw, m)+V’/BYV, m)] dm, (S. 164) 


where m is the instantaneous aircraft mass, V is the aircraft velocity, 


wat Euler's equation for this functional degenerates to a finite equation 
m 
od 2. Ry, (S, 165) 
ov om 


which in the general case does not pass through the given initial and terminal 
points (Vo, m7), (Vi, m,) in the (m, V) plane. 

An ingenious application of Green's theorem enabled Miele to construct 
the sought solution and to show that it is made up of pieces satisfying 
equation (S, 165) or the conditions B=fBmaxand B=0. For the case of a non- 
linear dependence P=P(B) the solution is no longer degenerate and the 
method of Lagrange's multipliers must be applied. 

In the present section, this problem is solved by a different mathematical 
approach, namely by the theory presented in § S.1 and §S. 2. 

A quite general dependence P=P(B) is assumed, As in /15/, we seeka 
dependence V=V(m) ensuring the maximum range for given values of to, Vo 
and m,, V;. It is shown that this problem of maximizing a functional can be 
reduced to maximization of a function of two variables S(m, V, 8) for 
every fixed m € (mo, m,), where V and 6 are assumed independent. The 
optimal program obtained in this way is found to be degenerate, i.e., in- 
dependent of the position of the end points (7m, Vo) and (m:, V;:).. In case of a 
linear characteristic P=jB, j= const, Sis independent of f and the solution 
V=V(m) coincides with that from /14,15/. If P=P(B) is nonlinear, the 
absolute maximum range is attained with the "pulsed thrust" program, 
which in fact constitutes the (¥,7) extremal that we mentioned before. This 
program amounts to the following: the aircraft starts from the initial state 
(™, Vo) with its engine off (B=0) or alternatively with maximum thrust 
B=Bmax until it reaches some curve V=V°(m)in the (m, V) plane. After that, 
the optimal program reduces to an alternating succession of powered 
sections with some optimal thrust P(f)=const and coasting sections with cut 
engines (B=0, m=const). The engine is switched on whenever the 
coasting velocity has dropped to V=V°(m). The engine switching frequency 
should be as high as possible. The higher the switching frequency, the 
deeper is the maximum. This program is continued until the aircraft 
reaches the line B=0 (m=const) or B=Bmax passing through the point(m:, V;) 
for V°(m,)>V; orVo(m1)<Vi, respectively. 
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3.1. Statement of the problem 


The equations of motion of a jet aircraft in horizontal rectilinear motion 
are 


Y —mg,=0; t 
és (S. 166) 
m= —B; | 
x=V, | 


where Mis the aircraft mass, g; is the gravitational acceleration, x is the 
horizontal coordinate of the aircraft, V is the aircraft velocity, A is the 
altitude of horizontal flight, X is the drag, Y is the lift, ya and m=—+”, 
We adopt the following hypotheses regarding the forces entering (S,. 166): 
1. The aerodynamic forcesX and Y are independent of the aircraft 
acceleration (the aerodynamic lag is ignored). Since the altitude is 
constant, X and Y are functions of velocity only. X is furthermore a 


function of the lift Y. 
2. The thrust Pis a function of the velocity V and the per-second fuel 


consumption Bf. It may be written in the form 


P(V, B) =fi(V)f2(B), 


where f,(V) is the velocity characteristic of thrust — an arbitrary positive 
function; fe(B) is the fuel consumption characteristic of thrust — an increasing 
function, generally displaying the property f2(B) 1<0. 

=0 


The physical meaning of P(V, 0)is back-pressure: The dependence P(h) 
is of no consequence, since 


h=const. 


3. The gravitational acceleration g, is constant. Equations (S, 166) are 
in fact written assuming constant g and ignoring the effects of the Earth's 
spin, The aircraft in these equations is regarded as a point mass. 

4. The fuel consumption B may vary between 0 and Bmax. 

Let us formulate the boundary conditions. We assume that at the initial 
time t=0 the aircraft mass is mp and the velocity is Vo. The point of origin 
is chosen so that x(0)=0. 

At the end of the flight, ‘=f, we have 


m=m,, V=Vy, x=Xn 


where neither /; nor x, are fixed. Our problem is to find a system of 
functions x(t), V(t), m(z), B(t), Y(t) satisfying equations (S.116) which ensure 
a maximum range *, on the set of pentades 


[x(), V(t), m(t), B(t), Y(2)) 
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satisfying (S.166). We are dealing with five unknown functions and four 
equations, i.e., the system has one degree of freedom. The last equation 
in (S. 166) gives an expression for the functional to be maximized 


ty oO Mo 
a= Vdt= —| vat={ + dm. (S. 167) 
0 t, m, 


Eliminating Y and dt between the first three equations in (S. 166), we find 


eter [P(V, B)—X(V, m)). (S. 168) 


The problem thus reduces to maximizing the functional 


Mo 
t=( Yam (S. 167%) 
m, 8 
under the constraints 
ah ene ae 
Vs i mp PU, DX (m, VY (S. 168*) 
O<B< Bax’ (S. 169) 
Vim)=V,; V(m)=Vo. (S. 170) 


Problems of this kind were previously considered in§S.2, Here we 
have 


F(m, V, B=; (S.171) 
g(m, V, =~ IPV, 8)—X(m, V)]. (S,172) 
In virtue of the above properties of the function P(V, 8), 
0g 
<0 S.17 
38 < ( 3) 
for all B €[0, Bmax]. 


To finally reduce this problem to the form discussed in §S. 2, it suffices 
to replace $B with an auxiliary parameter 


p=2 4-1, (S.174) 
Inequalities (S. 169) then take the form 
lp|<1, (S. 175) 


and condition (S. 173) reduces to 


OR 
ae > O (s. 176) 
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For p=1, i.e., B=Bmax, the derivative V’=g(m, V, B) takes on the least 
admissible value at the point (m, V) of the phase plane, and for p=1, B=0 it 
takes the largest value: 


V'=g(m, V, 8) +00 for 6+0, (S.177) 


Physically (S.177) signifies the fact that the limit value B=0 corresponds 
to flight with constant mass (m= const), i. e., motionof the representing point 
along the vertical in the phase plane (m, V). The 
latter factor signifies that the independent variable 
m has the properties of the auxiliary variable t in- 
troduced in § S. 3(constancy onthe limit direction B=0). 

Let us now establish the boundary of the region 
B of the physically admissible values of the 
function V(m). The boundary is made up of 
pieces of the following lines (Figure S, 3): 

Line 1: mgr—Y(V, amax)=0, where dmax is the 
maximum admissible angle of attack. This line 
is the lower limit for the admissible values of 
the function V(m), 

Lines 2 and 3: the vertical segments m=mmpo 
FIGURE S. 3 and m=m,, 

Lines 4 and 5: lines of flight with maximum 
thrust B=fmax passing through the points (7, Vo) and 
(m,, V,) which give, respectively, the upper and the lower limit of the ad- 
missible values of V(m). 


~ = 
™, mg ™ 


3.2. Optimal control program 


Let us first determine the function S(m, V, q). Here V(m)is the zero 
closeness function of the sough extremal, g(m) is the local value of the fuel 
consumption fp. Since the independent variable mis also the parameter, 
we may use expression (S. 49) for §. Thus, 


Vv 

S=F—Wegi(m, V, 2)—{ W,(m, t) dk; (S. 178) 

F(m,V,0) 

W = Lim, V0) 
ER (S. 179) 

Using (S.171) and (S.172), we may write 
W (m, Vaan _, -(S, 180) 

x(m,V) 
x(m, V)=X—P(V, 0); (S. 181) 


S(m, V, aL 4B PV, g)—X (m, V)I— 


Vv 
— J Wa(m, &) db, 
¢ 
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or, using (S. 181), 


Vv 
_ VAM) IG) mo) og 
a x(m,V) Wega prea g (S. 182) 
c 
where 
j= POLLO (3, 183) 


For an ideal liquid-propellant engine, j is independent of 8 and gives 
the ideal nozzle velocity. 

Let us find the pair of functions V(m), g(m) on which S(m, V, q) attains 
its absolute minimum over the set of admissible values of V and q for 
every fixed mC [m,, mt]. 

Writing the thrust in the form 


P(V, B)=fi(V)f2(B), 


we can devise an attractively simple method of solution of the last problem. 
In fact, the point g=gon which S attains its maximum for any m, VC Bis 
independent of m and V coincides with the maximum of the function /(B) on 
0<B<Bmax. Figures 4a and 4b show two alternative fuel consumption 
characteristics of the thrust program. Draw a family of imaginary rays 
from the point (0, /2(0)) which have at least one common point with the curve 
of fe(B) on the semi-interval(0, Bmax). In this family, we select a ray y which 
makes the largest slope angle to the horizontal axis. The abscissas qi,..., 4 
of the intersection points of this ray with the curve f2(B) on the semi-interval 
(0, Bmax) give the maximum of j(f), i.e., S, as we see from (S.183). Thus, 
if P(V, B) is representable in the form of a product f,(V)f2(B), the optimal 


value ®B=8 is independent of mand V. 


f2(p) Sotp) 


f 7 q Bmax b Bmax=q P 


FIGURE S.4 


The optimal dependence V°= V°(m) is specified by the condition of 
absolute maximum of the function of single variable S(m, jmax, V°) in B for 
every fixed m. As a necessary condition, it should consist of pieces of 
the boundary of B and of continuous pieces satisfying the finite equation. 


Saf — £1) Jax ~|aar a] =0 (S. 184) 
VULXWV, my—P(V.0 ly LXV, m)—PWV,9) Sn 
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which are joined by vertical segments at the points m=yp;(i=1, 2, ..., r) where 
S(t, J max: V, (u))=S (ui, J max» Vo(p1)), (S. 185) 


V,(m) and V2(m) being two solutions of (S. 184). 

Equation (S. 184) coincides with the equation W=0, the so-called 
"singular curve" from /14/, i.e., it coincides with Euler's degenerate 
equation for the case of linear dependence of thrust on fuel consumption, 
if we take P(V, 0)=0, f1(V)=1 and jmax=V2=const, the effective nozzle 
velocity, is assumed constant. Here the first two conditions are the 
simplifying assumptions adopted in /14/. Consequently, we can apply the 
results of /14/ for qualitative estimate of the zero closeness function and 
the optimal V=V°(m). In other words, all the properties of the extremal 
V(m) investigated in detail in /14/ are applicable: 

1) Equation (S. 184) has two solutions: supersonic V=V9(m)>a and sub- 
sonic V=V$(m)<a, where @ is the velocity 
of sound at altitude A. Along both solutions 
“ >0 (motion with decreasing velocity), and 
V$(m) invariably passes through the origin. 

2) The optimal flight program first follows 
the line V,(m) up to the point m=p, where 
condition (S.185) is satisfied. Then it changes 
to coasting along the vertical m=wp until it 
reaches the line V2(m), and then it proceeds 
along the line V2(m). 

/ The motion from the point (1%, Vo) to the 
FIGURE S.5 solution of (S. 184) and from the latter to the 
point (m,, V,) proceeds along the appropriate 
piece of the boundary of 8, i.e., either with 
f=0 or with B=Bmax, according as the points (mp, Vo) and (m;, V,) are 
respectively located below or above the line V°=/°(m) (Figure S. 5). 

According to our results, the entire optimal function V=V°(m) is deter- 
mind (with necessity and sufficiency) by the condition of the absolute 
maximum of the function of a single variable S(m, jmax, V) in B for every 
fixed m. ‘ 

We have thus obtained a local value of the fuel consumption g on the 
inclined sections, which is independent of m and V and corresponds to the 
maximum value of j/. We have also obtained the function V°(m) which 
coincides with the optimal velocity V(m)if constant nozzle velocity jmax is 
assumed. 

By Theorem 2, the pair V(m), §(m) defines the absolute (y, g) minimal 
iz C Uy if it satisfies the inequalities 


™y M My ™ 


# gm, 0, N20; (S188) 
V (p+0)—V(p—0)>0, (S. 187) 


where uw is the point of discontinuity of V(m). Inequality (S. 186) is always 
satisfied, since the results of /14/ show that if the minimal V(m) contains 
pieces of both branches, V,(m) and V2(m), motion always proceeds first 
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along the supersonic branch V;(m) and then along the subsonic branch Vo(m) . 
It thus remains to check (S. 187). 


As we have mentioned before, a >0 along both extremal branches. On 


the other hand, according to (S.172), g<0 whenever 
P(V, B) >X. (S. 188) 


The last inequality always holds true, i.e., the maximum j as a rule 
corresponds to a thrust which is greater than the drag. In this case, (S, 186) 
is naturally satisfied. The drag X increases with increasing V and m, so 
that (S.188) may fail, if at all, on the supersonic branch V\(m). But on this 


branch ~ also reaches its maximum value (see /14/), and (S. 186) is satis- 
m 


fied even if (S.188) does not apply. 

The pair V(m), @(m) thus constitutes the sought solution of the problem. 
It defines a (V, 7) line uw € Up on which the range has its absolute maximum. 

Let us examine the physical meaning of this (V, ¢7)maximal @. 

Each approximating polygonal line y, € U represents motion with periodic 
thrust switching. The inclined sections of the polygonal line yn correspond 
to powered flight with fuel consumption B=@(m;), and the vertical sections 
represent coasting with the engine cut. 

The engine is always switched on as soon as the coasting velocity with 
m==m,;reaches the value V°(m;), and it is cut off when the mass drops to 441. 
The engine is switched on a total of n times. 

We refer to this program as "pulsed thrust'’ program with frequency 7, 
mean velocity V°(m), and power thrust q. The (V, q) line uC Uo represents 
a pulsed thrust program of infinite frequency. 

Let us now consider the extremal for various typical particular cases. 

I. There is a finite number of values g; (i=1,2,...,%) satisfying the 
condition j(B) =jmax (a finite number of intersection points between the ray y 
and the characteristic f2(B) on (0, Bmax)). All 9; satisfy inequality (S. 186) 
everywhere on [m,, mo]. 

The function B°(m) defined by the equation 


dV0(m) _ beak ~ 
m= a [P(V, 8)—X(V°, m)] (S. 189) 


continuously varies with m. Since, on the other hand, any of the optimal 
values q=f:=const, we have g(m, V, aye Hence, the optimal program 
dm 


is a (V, q) line in the (m, V) plane, i.e., the ''pulsed thrust" program of 
infinite frequency. The representing point in the (m, V) plane should 
therefore move from position (mp, Vo) along the boundary of B(p=0or B=Bmax) 
until it reaches the zero closeness line V=f°(m), 

After that, the "pulsed thrust" program begins with maximum per- 
missible frequency. The fuel consumption on the powered sections should be 
equal to any one of the g; values, and the mean velocity should coincide with 
the optimal function V=V°(m). This program should be continued until the 
representing point again reaches the boundary. Then the motion continues 
along the boundary until the terminal position (m,, Vi) is reached. 
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ll. There is a finite number of values @;, i=1,2,..., &. 

Some of them satisfy inequality (S.186), whereas others do not satisfy 
this inequality. The sought optimal is a "pulsed thrust" program, similar 
to that described under I. The fuel consumption on the powered sections 
should be equal to any of the B; satisfying (S. 184). 

Ill. There is a continuum of values 


0<P, <8: < Bray 


for which j(q) =jmax (the ray y has a common segment with the thrust 
characteristic fo(8), Figure S.6). Part of the segment [fo, B,] satisfies in- 
equality (S. 188), where the other part does not necessarily satisfy this 
inequality. 


ia 
pmax =r 


f2(0) 


FIGURE S.6 


The particular case Bo=0, Br = Bmax here corresponds to a linear thrust 
characteristic. In this case, infinitely many optimal solutions exist. They 
correspond to "pulsed thrust" programs with infinite switching frequency 
and a common zero closeness line V=V°(jmax, ™), which are analogous to 
those described under I, but differing in that they may have any dependence 
q(m) € [Bo, Br] 28 long as it satisfies inequality (S,188). As a particular 
case, we may even take a dependence q=f°(m) which converts the in- 
equality in (S. 188) into equality. The "pulsed thrust" program then 
degenerates into Hibbs's continuous control /15/ originally derived for a 
linear characteristic P(B), and the optimal function V(m) coincides with the 
zero closeness line P°(m). 

The physical meaning of the multiple solutions in this case is that the 
range is affected only by the mean velocity V°(m), and is insensitive to 
local deviations from the mean. 

However, as the switching frequency is of necessity finite in practice, 
a single minimal should be selected among the infinitely many optimals, on 
which the range attains a strict maximum. This minimal coincides with 
Hibbs's continuous thrust control characteristic. 

In mathematical terms, the infinity of solutions implies that j= const on 
[Bo, B:] and therefore the function S(m, j, V) is independent of g, being a 
function of a single variable V for every fixed m. 
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